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A PROPOSED MECHANISM FOR ICE FRICTION! 
C. D. NIVEN 


ABSTRACT 


The collapsible structure of ice is stressed. The adhesion bond is visualized 
as being associated with a readjustment of a partially collapsed or distorted mole- 
cular structure when cooling sets in after plastic flow has occurred in the asperity. 
With this general idea in view the friction mechanism on ice is compared in 
detail with that on a non-collapsible material, and explanations for observations 
are suggested. 


INTRODUCTION 


In previous communications (Niven 1954, 1956, 1956, 1958) the results 
of some measurements of ice friction carried out in the cold room have been 
described. These investigations were undertaken in order to acquire some 
information on obscure points, such as the effect of high loading, the effect 
of heating the runner, the effect of speed, the behavior of rubber. In the present 
communication an attempt is made to fit the results into the modern adhesion 
theory of friction which explains the behavior of metals. 

Up to the present time two ideas have dominated the thinking of engineers 
and scientists in connection with ice friction: one idea is that pressure melting 
supplies a film of water on the surface: the other idea is that frictional heating 
supplies that film (Bowden and Hughes 1939). The two ideas have a point 
in common, namely, that if melting can be accounted for, the low friction of 
ice is explained and the problem is solved. Researchers have accepted this 
assumption uncritically: for instance Klein (1950) tried to see the water by 
using a window on the ski: Bowden and Hughes (1939) used an electrical 
method to prove the presence of water on the ice surface: McConica (1951) 
suggested the possibility of the adsorption of water molecules or ‘“hydrol’”’ 
by the ice: Weyl (1951) suggested that the surface of the ice was liquid owing 
to polarity adjustments in the layers beneath. 

But to merely account for a film of water on the surface of the ice is far 
too superficial an approach to the ice friction problem because apart alto- 
gether from the low friction value, ice friction violates Amontons’ law at high 
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loading: it violates very seriously the approximate law that friction is in- 
dependent of temperature; and it violates to a small extent the other approxi- 
mate friction law that friction is independent of speed—particularly at low 
speeds. Before a substance stands out as being so exceptional there must 
be something fundamentally different about it. Any attempt therefore to 
explain ice friction entirely by a frictional heating mechanism is unsatisfactory 
because this implies that all hard substances which melt should have ice- 
like friction qualities just below their melting points. But the following more 
specific objection to the frictional heating explanation was noted by McConica 
(1950): a corollary of that explanation is that sliders vich high thermal con- 
ductivity should have high friction: McConica points out that magnesium, 
which is a good heat conductor, has a low friction. 

The other and older theory—the pressure melting idea of Reynolds—has 
been criticized and practically rejected because the pressures necessary to 
cause liquefaction are very high. However, this theory has a point to recom- 
mend it, in that it picks out a unique quality of water—namely its expansion 
on freezing—and makes the theory depend on this peculiarity: indeed, some 
knowledgeable scientists still cling to this theory on the grounds that the 
pressure on the tip of an asperity is great enough to liquefy a tiny quantity 
of water: but this implies that the area of real contact is very small indeed 
so that this explanation is not satisfactory. 

Wide differences in the friction values of the various substances on ice suggest 
that one must go beyond the macroscopic physical ideas of melting by pressure 
or heat and imagine what may be going on at the surface of the ice on an 
atomic or molecular scale—in other words, apply to this problem some 
information available about the structure of water and ice. 


THE EFFECT OF HIGH LOADING AND OF HEATED RUNNERS ON ICE 
FRICTION 

The writer’s attention was drawn to the effect of high loading by Mr. W. E. 
Wakefield of the Forest Products Laboratory in Ottawa, who stated that after 
a sleigh was very heavily loaded the drag increased very little if the load was 
increased. In his report Wakefield (1938) merely emphasized that friction 
drops at high loading, which is really an understatement of a very remarkable 
phenomenon, because in mild winter temperatures laboratory tests (Niven 
1954) showed that drag was practically independent of load at high loadings: 
in severe winter temperatures the results indicated that far higher loads would 
be required for the drag to be constant with load. Figure 1 illustrates these 
findings and simply forces one to the conclusion that the pressure on the ice 
cannot be ignored in any sound explanation of ice friction. Amontons’ law— 
virtually the king-pin of friction theory for ordinary substances—does not 
hold good for ice if the pressure is high and the temperature moderate. 

The work on the heated runners was carried out to see if they had a lower 
friction on ice than unheated runners. If a film of water on the ice surface 
was the explanation of low friction, it was expected that heating the runner 
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would assist the formation of the film and so reduce the friction. Pfalzner 
(1947) and the writer (Niven 1956) were independently forced to the same 
conclusion—namely that heating the runner reduced the friction by only a 
very slight amount. The experiments were disappointing but the fact that a 
slight reduction could be observed under certain circumstances e.g. at lower 
temperatures, rather supported than refuted the frictional heating explanation 


of ice friction. 
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Fic. 1. Drag vs. load curves at —2°C and —17°C for a miniature stainless steel sleigh 
with an apparent contact area of 1.6 sq. cm. 


These tests at high loading and with heated runners suggested that if the 
explanation of ice friction were merely a problem of accounting for a film of 
water, the pressure-melting idea and the frictional-heat idea were both partly 
right. Unfortunately the basic idea of lubrication by means of the film of water 
seems open to criticism, because water is certainly not a good lubricant in 
general and in contact with a cold surface it is an adhesive. The idea that the 
liquid film reduces friction seems to have crept in because the idea of mole- 
cules gliding past each other in the liquid state was quite familiar. If for some 
reason ice happened to be a solid in which the molecules could turn slightly 
or swivel in groups then friction on such a solid might be low regardless of the 
presence of the liquid film. However, in the meanwhile let us consider the 
effect of a liquid film in contact with the cold surface. 

The film acts as an adhesive and therefore anything which tends to produce 
it also tends to create a condition whereby the slider can adhere to the surface. 
The water film, if it does exist at all, must then perform two functions in the 
friction mechanism—firstly, it must reduce friction by creating a condition 
whereby molecules can easily slip past each other and secondly, it must aug- 
ment friction by creating a condition whereby one surface can adhere to the 
other. It is these two conflicting tendencies which complicate the explanation 
of ice friction: but before pursuing this point farther, the sliding process of 
metals which obey the normal friction laws must be very briefly considered. 
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THE SLIDING FRICTION MECHANISM FOR NORMAL METALS 
COMPARED WITH THE MECHANISM FOR ICE 

Sliding friction is mainly an adhesion phenomenon with a certain amount 
of ploughing action in the surface. Pressure and rubbing cause the tips of the 
asperities to weld together in the case of the normal metals and the force 
required to shear these welds is what accounts for the frictional force accord- 
ing to modern theory. When the two surfaces are not both metallic it seems 
necessary to assume other kinds of bonds than the metallic bond: the strength 
and type of these bonds presumably depend on the electronic structures of the 
surface atoms. 

To fully appreciate what is happening during the sliding mechanism it is 
well to break up this general description of sliding friction into five events, 
which must be considered to blend into each other as the one asperity passes 
the other. 

The first event is a collision between an asperity projecting down from the 
slider and an asperity projecting up from the surface, resulting in a drastic 
pushing of molecules in one or both asperities, accompanied, of course, by a 
stretching and straining of bonds—in fact, a plastic flow. This amounts to 
saying that heat is produced. 

The second event in the mechanism is that the two colliding asperities take 
a greater share of the load on the slider and, over the little area of real con- 
tact formed by the plastic flow, molecules of the surface are brought into close 
contact with molecules of the slider—pressed against them and rubbed by 
them. 

The third event is that the surfaces adhere—in the case of metals, weld 
together. 

The fourth event is that plastic flow occurs in one or both of the asperities 
as the forward movement of the slider causes the one asperity to drag the 
other along with it. 

The fifth event is that the asperities separate possibly with a piece of one 
of them torn out and sticking on to the other. 

If the main part of the frictional force is accounted for by the shearing of 
the welds, this means that frictional heating is to be accounted for by the 
plastic flow and breaking of bonds after the welding process. However, since 
Amontons’ law for ordinary metals is explained by the dependence of the 
area of real contact on the load for materials which deform plastically and 
since the weight is continually being carried by new asperities as the slider 
moves forward, some plastic flow and therefore heating occurs before welding: 
therefore there is expenditure of energy before as well as after welding, or in 
other words part of the frictional force is employed in the pushing of one 
asperity by the other. This apparently small detail is of considerable im- 
portance in explaining ice friction. 

The structure of ice is an open structure (Bernal and Fowler 1933) con- 
sisting of an atom of oxygen surrounded by four other oxygen atoms heid 
to the central atom by hydrogen bonds and forming a tetrahedron around it. 
This structure differs basically from another well-known tetrahedral structure 
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often applying to metals, namely the close-packed structure of the hexagonal 
or face-centered cubic types, where the central atom is missing and the four 
atoms forming the tetrahedron nest snugly into each other. 

To appreciate then the difference between ice friction and the friction of 
quite a number of ordinary metals, one must compare an asperity composed 
of atoms or molecules in the open tetrahedral structure with an asperity having 
the close-packed structure. When the close-packed structure is hit by a slider 
asperity, atomic disarrangement occurs and this may be regarded as a ten- 
dency to melt the surface asperity: pressure, however, tends to keep the 
molecules nested in place and this may be regarded as a tendency to solidify 
it. On the other hand when the ice asperity is hit by a slider asperity the 
molecular disarrangement which occurs is accompanied by a tendency to 
make the open structure collapse and get into a structure more like that of 
the water structure: this amounts to saying that when the open-structured 
ice asperity is hit, agitation and pressure co-operate in tending to melt the 
asperity. With the close-packed asperity they work against each other and 
welding, instead of the tendency to melt, results. 

If the load on the slider were so great that agitation and pressure com- 
pletely liquefied the asperity then no further increase in pressure should cause 
any more melting, and drag should remain constant. This condition would 
most easily arise at temperatures near to 0° C when the structure should 
collapse more easily than in severe temperatures (see Fig. 1). 

Since water retains its tetrahedral structure after melting occurs, a more 
accurate description of the pressure-melting mechanism is a swivelling of 
open tetrahedra held together by electrostatic bonds. When the pressure is 
large enough to completely “‘crush”’ the ice structure into the liquid tetra- 
hedral structure, no further pressure—within reasonable limits—can do any 
more crushing: for some reason this structure is mobile. This, of course, was 
essentially the point which Reynolds intuitively perceived long before the 
tetrahedral structure of water was heard of. The present explanation of ice 
friction assumes that in between the completely ‘‘crushed”’ structure applicable 
to liquid water and the uncrushed or unloaded structure applicable to ice 
there is a continuous series of structures in which the tetrahedra have swivelled 
to a greater or less extent depending on the load. That is why throughout the 
above discussion the formation of water of lubrication has been referred to 
conditionally. The process during the friction mechanism which is usually 
referred to as the formation of water of lubrication is visualized by the writer 
as a swivelling of tetrahedra to form a structure more like the water structure— 
a tendency to liquefaction rather than a complete liquefaction. The suggestion 
is that this partial swivelling of tetrahedra—somewhat reminiscent of a 
ball moving in a ball race—reduces the friction without the actual production 
of a water film as a result of complete swivelling. 

Returning now to the general friction mechanism for normal metals, after 
the agitation due to collision of the two asperities has taken place the slider 
asperity starts to leave the surface asperity. At this stage the welds which 
have been formed by agitation and pressure start to shear. But with ice the 
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situation is quite different because the welds have not been formed: in fact, 
the slider asperity must be visualized as being in contact with a spot of ice 
surface which is to be regarded as having a distorted tetrahedral structure 
more applicable to water than the undistorted ice structure was. The surround- 
ing temperature is below 0° C and so “‘freezing”’ begins, that is to say the dis- 
torted structure returns to the normal ice structure. It is at this juncture that 
the question of whether or not the slider is composed of a material to which 
ice will adhere becomes highly important, but before discussing the adhesion 
bond, let us continue to concentrate attention on the freezing process which 
hardens up the ice asperity immediately after it has been subjected to mole- 
cular disarrangement and pressure. 

If the temperature of the main body of the ice and of the slider is well below 
0° C the freezing process will be rapid and the bond will be strong. Therefore 
ice friction should be highly dependent on temperature—this agrees with 
observation. The rapidity with which the asperity will harden up should be 
proportional to the temperature difference of the asperity and the ice: in 
other words the contribution to the friction made by the adhesion bond should 
depend on the number of degrees the ice temperature is below 0° C. 

Furthermore since the formation of the adhesion bond is due to heat loss 
by conduction, it is to be expected that the bond will be poor if the time 
during which heat loss can take place is short, that is to say if the slider is 
moving quickly. Observation (Niven 1956) shows that ice friction drops 
slightly with increase of speed—particularly at low speeds. 

The mechanism for ice friction described above provides then for the obser- 
vations that ice disobeys all the laws of friction—Amontons’ law at high 
loading and also the two approximate laws regarding independence on tempera- 
ture and speed. 

For the sake of clarity only a close-packed structure was discussed above in 
comparing the normal friction mechanism with the ice friction mechanism 
but the argument applies not only to a close-packed structure but to any 
structure whose molecules tend to nest closely into each other and keep their 
arrangement when compressed. This, incidentally, amounts to saying that the 
argument does not apply to those substances which expand on freezing, and 
so the suggestion is that any solid which expands on freezing should have 
friction qualities to some extent like those of ice and different from normal 
substances. A hint that this conception may be correct is an observation by 
Bowden and Freitag (1955) that bismuth has a low friction coefficient com- 
parable with that of ice. 


THE ADHESION BOND 


When metal slides on metal the adhesion bond must be a metallic bond 
because Bowden (1950) and his collaborators have shown that the bonds of 
the weld can be as strong as the bonds which hold the atoms together in the 
metal. The adhesion bond when ice freezes on various substances cannot, 
however, be specified nearly so precisely. 

According to de Bruyne (1951) the “‘first rule of adhesion” is the use of 
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‘“‘an adhesive that ‘wets’ the surface’’: therefore substances which can be wet 
by water can be expected to bond with ice. Bowden (1953) showed that 
plastics which do not wet easily have a low friction on ice at low temperatures 
and thus firmly established the experimental basis for an adhesion theory 
of ice friction: this constituted a tremendous advance in understanding the 
nature of the ice friction mechanism. 

Bernal and Fowler (1933) have explained that the hydrogen bonds holding 
the oxygen atoms together in the ice structure are essentially electrostatic in 
character, which suggests a possibility that when ice freezes on to some 
substance the bond may be electrostatic. Weyl (1951) visualizes under the 
surface of the ice a gradual adjustment of the bonding for several hundred 
layers of molecules, to give a surface structure like that of water, and he 
points out that just as soon as the surface is covered, forces may be supplied 
at the surface which permit readjustment to the normal bonding of the 
interior of the ice. Wood absorbs water and supplies therefore the right type 
of bonding—presumably electrostatic. Teflon, on the other hand, which is 
hydrophobic and incidentally very resistant to chemical combination in general, 
apparently fails to supply the right type of forces to permit readjustment of 
the bonding below the surface of the ice, and so little adhesion occurs. Why 
bakelite is the only other organic material with a friction value comparable 
with that of teflon and why a material like polystyrene has a considerably 
higher friction value (Niven 1956) than either of these are questions dealing 
with fine points in the adhesion theory of friction which as yet remain un- 
answered. Since McConica (1950) found that the polarity of a ski wax was 
important in assessing its friction on snow, polarity may be a factor in 
explaining the behavior of the various plastics. 

Metals, with their highly mobile valency electrons, are easily polarized: 
accordingly one can expect high friction between metal and ice at low tempera- 
tures unless the metallic surface has a protecting layer of oxide, difficult to 
polarize. 

The adjustment of the bonding under the surface of the ice when covered 
by some substance having suitable forces, strictly speaking explains static 
friction more obviously than sliding friction, because, when a substance is 
sliding on ice, the molecules of the ice asperity are in a state of agitation and 
disarrangement before bonding occurs. But any substance which can form 
the bond by readjusting the forces under the surface will also form bonds with 
a resolidifying asperity in which the forces are in a state of disarrangement: 
therefore, one might expect that substances with high static friction on ice 
should also in general have high sliding friction. 

The application of the adhesion theory of friction to rubber on ice presents 
a problem of special interest, because rubber is non-polar and should therefore 
have poor adhesion to ice: experiment shows, however, that its friction on ice 
is very high. Admittedly rubber requires special theoretical treatment (Denny 
1953) to explain its friction on surfaces in general, on account of its exceptional 
elastic qualities. It has a high friction on dry surfaces: it can conform to the 
shape of the surface and thereby a weak adhesion bond can be multiplied more 
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often than is possible for a hard surface making contact only at the tips of 
asperities. Furthermore, because the rubber can shape itself over and around 
the ice asperity, it can, so to speak, grip any roughness—like the cogs on wheels 
interlocking. This grip may be somewhat like the grip which McConica (1950) 
describes as a ‘mechanical attachment”’ in discussing the behavior of plastic 
ski waxes on snow. However, no matter whether this grip on ice is a true 
adhesion or a mechanical attachment, it must let go if the asperity should 
melt. This explains some results obtained by the writer (Niven 1958) on 
rolling friction of rubber tires. At —1° C the rolling friction was quite appre- 
ciable—indeed comparable with the friction at —3° C—but at temperatures 
very close to 0° C it was small, indefinite, and independent of load at higher 
loadings. 


CONCLUSION 

While our knowledge of the adhesion bond between ice and the various 
substances is still imperfect, enough is known to indicate that adhesion plays 
a very important part in the ice friction mechanism. But adhesion is probably 
not the whole explanation because at temperatures just below 0°C the 
adhesion of the slider material to ice is of relatively small importance: both 
stainless steel and teflon slide excellently at these temperatures. It is at the 
colder temperatures that adhesion appears to become highly important: steel 
has then a far higher coefficient of friction than teflon. An idea somewhat 
like the old asperity theory of friction has been used in the above discussion 
to explain how the asperity is “‘prepared’’ by agitation and pressure for the 
adhesion bond to form by cooling. At temperatures only slightly below 0° C 
when conduction of heat from the asperity to the main body of the ice is very 
small, this adhesion or ‘‘refreezing’’ bond should be almost negligible and so 
the contribution to the frictional force made by one asperity pushing the other 
when they collide may quite reasonably be expected to become important. 

Anything which helps to weaken the bonding in the ice so that the slider 
asperity can push forward after collision may be expected to lower friction, 
during the early stages of the mechanism: therefore, high temperature, high 
loading, and high impact (speed) should reduce friction. Anything which 
helps the bonding to occur between the slider asperity and the ice may be 
expected to increase friction, during the later stages of the mechanism: there- 
fore, low temperature, low speed, and a slider substance which will wet or 
polarize easily should increase friction. 
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ABSOLUTE STANDARDIZATION OF RADIOACTIVE 
NEUTRON SOURCES 


I. ACTIVATION OF MANGANESE BATH! 
Kk. W. GEIGER AND G. N. WHYTE? 


ABSTRACT 


The neutron emission rate of a Ra-a-Be source of the National Research 
Council has been measured by moderating the neutrons in manganese sulphate 
solution surrounding the source. Some of the thermal neutrons captured in the 
bath produce manganese-56, and the neutron emission rate is found from an 
absolute determination of the resulting activity. A number of corrections are 
discussed in detail. The value of (3.22+0.05) X10® neutrons per second found 
for the neutron emission rate shows satisfactory agreement with the results of 
our intercomparison with the primary neutron standard of the National Bureau 
of Standards, and allows a tie-in with a series of international comparisons of 
primary neutron sources. 


INTRODUCTION 


For the determination of the emission rate, Q, of a radioactive neutron 
source three principal methods are available: 

(1) The source neutrons can be thermalized in a moderator and counted 
absolutely; an integration over the volume of the moderator yields the 
total emission rate. 

(2) Charged particles, associated with the neutron-producing reaction in 
the source, can be counted absolutely. 

(3) The source and an absorber of known neutron absorption cross section 
can be introduced into a nuclear reactor and neutron emission and 
absorption balanced. 

There are also a number of more indirect ways of determining the neutron 
emission rate of a source, such as by comparison with the neutron emission 
rate from an accelerator-induced reaction in which the associated particles 
can be counted absolutely. The T*(d, 2)He* reaction is commonly used for 
this purpose. 

The present paper (1) describes a standardization using the first method, 
applied to the standard Ra-a-Be neutron source of the National Research 
Council (N.R.C.). A later paper (II) will deal with the standardization of a 
Po-a-F source by the second method. 


PRINCIPLE OF THE MEASUREMENT 
The measurement is based on the slowing down and capture of neutrons 
in a bath of aqueous manganese sulphate solution (O’Neal and Scharff- 
Goldhaber 1946; Alder and Huber 1949; DeJuren and Chin 1955a). If the 
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bath is large enough, practically all the neutrons emitted by a source which 
is in the center are slowed down to thermal energies and captured, either by 
hydrogen or by manganese. From the ratio of the capture cross sections of 
manganese and hydrogen, one can calculate the fraction of the total captures 
that occur in manganese. Since manganese has only one stable isotope, Mn®, 
each capture in manganese results in the formation of a nucleus of Mn°, 
which decays with a half life of 2.58 hours. This can be readily detected 
through either its B- or y-rays. If the neutron source is left in the bath until 
the manganese activity has built up to equilibrium, the number of manganese 
atoms disintegrating per second in the bath is equal to the number of neutrons 
captured per second by manganese nuclei, Qa. If the total activity of Mn* 
can be measured, the rate of neutron emission of the source is given by: 


(1) Q = [{ (1+@) Vygnomat+ Nsog+Vucn} ‘{ (1 +a) Numomn!} JQmn 


where .Vygn, -Vs, and .Vy are the numbers of manganese, sulphur, and hydrogen 
nuclei per cm’, and gyn, os, and oy are the corresponding thermal-neutron 
capture cross sections. The factor 1+a arises because the manganese cross 
section does not vary exactly as 1/v in the epithermal region. A calculation 
following the method of DeJuren and Chin (1955a) shows that at the con- 
centration used here the manganese captures 1.7% more neutrons than would 
a perfect 1/v absorber of the same thermal cross section. The factor 1+a 
thus amounts to 1.017. 

When the solution is stirred, a mechanical integration of the induced 
activity over the whole volume of the bath is performed and the absolute 
activity of a known fraction of the solution yields the number of neutrons 
per second captured by manganese. However, the dimensions of the bath 
must be large to ensure that most of the neutrons are slowed down and 
captured. As a result most of the solution is exposed to only a very low thermal- 
neutron flux, and the average specific activity is too low to be measured 
directly. Instead a relative measurement of the activity is then made by 
inserting a radiation detector into the center of the bath. In order for the 
reading to be interpreted absolutely, the detector must be calibrated by 
observing its response when a known quantity of active manganese is mixed 
into the bath, so as to determine the number of disintegrations per second 
of Mn** in the bath corresponding to 1 count per minute in the immersed 
detector. 


APPARATUS AND EXPERIMENTAL PROCEDURE 

The manganese sulphate bath was contained in an open cylindrical steel 
tank, 85cm in diameter, coated with neoprene to prevent corrosion. The 
tank was filled to a depth of 85cm with a nearly saturated aqueous solution 
of manganese sulphate containing 30.31% MnSO, by weight. The exact con- 
centration was determined from the density of the solution and its tabulated 
dependence on concentration (International Critical Tables 1928). 

The source to be calibrated was the N.R.C. radium-a-beryllium source 
No. N-200-1, containing 183 mg of radium element in the form of the carbonate 
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and 2000 mg of beryllium powder, pressed into a pellet to ensure physical 
stability. The source was prepared in 1948. The pellet is contained in a 
cylindrical monel capsule of 2mm wall thickness, both the diameter and 
the height being 19 mm. 

In order to make a measurement the source was enclosed in a thin-walled 
polyethylene container and suspended in the center of the solution for a 
period of several hours. After removal of the source, the bath was stirred for 
15 minutes by means of a 1/8 horsepower electric mixer. 

The relative activity of the bath was measured by inserting a commercial 
‘“‘multicellular’’ Geiger—Miiller tube, protected by a brass shell, into the 
center of the solution. This counter consisted of several bismuth-cathode 
tubes in parallel, enclosed in an outer envelope 30 cm long by 5 cm in diameter, 
which combines high y-efficiency with short dead time. The response of the 
counter was checked periodically with a small radium source and found to 
be stable to within better than +0.3%. 

Five or more 10-minute counts were taken and the rates corrected for 
decay of Mn** back to the time of source removal. The published half life 
(DeJuren and Chin 1955a) of 2.58 hours was used, and this fitted the data 
within the statistical uncertainties. Two runs over several half lives also 
confirmed this figure and showed that no other radioactive y-emitter was 
produced in the bath. Since irradiation with the source took place for only 
approximately 15 hours, a further small correction had to be made to obtain 
the counting rate for irradiation to equilibrium. 


THE SZILARD-CHALMERS PROCESS AND ABSOLUTE STANDARDIZATION 
OF Mn* 

To calibrate the counter in the bath a known amount of Mn** has to be 
added to the inactive bath. The best method to determine the absolute Mn** 
activity is by 47 B-counting; however, the specific activity of the Mn°* solution 
has to be high to make self-absorption of the source deposit in the 4m counter 
negligible. This was achieved by using the Szilard—Chalmers (1934) process. 

Approximately 600 g¢ of Ca(MnQy,)o in 900 ml H.O were irradiated over- 
night with a 1g Ra-a-Be neutron source. The activated manganese leaves 
the permanganate ion by recoil due to neutron capture y-rays and forms 
insoluble MnOs, which was separated by filtering through a glass filter, 
medium grain. The active MnO, residue was dissolved in 8 ml of 6 4 HCI. A 
few drops of H2SO, were added and the solution evaporated to dryness until 
no more SO; vapors escaped. The few milligrams of resulting MnSO, were 
dissolved in H.O, filtered, and made up to 50 ml. This represented the stock 
solution. The Szilard—Chalmers process yielded about 60% of the available 
active manganese and an enrichment in Mn*® isotope by a factor of 5000. 

Three samples of 0.05 ml were pipetted for 47 counting. The samples were 
prepared following the method of Pate and Yaffe (1955) on gold-plated VYNS 
foil (approximately 5 uwg/cm? foil and 15 ug/cm? Au) using insulin as wetting 
agent. Drying was carried out in an NH; atmosphere under infrared light. This 
produced a very finely grained MnO: deposit which was about 100 ug /cm? 


thick. 
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The 42 counter was operated in the proportional region under continuous 
CH, gas flow. Counting plateaux with no statistically significant variation 
could be obtained over a range of 200 volts by using a non-overloading pulse 
amplifier of 0.2-mv input sensitivity. Since the dead time of the amplifier was 
5 wsecond, resolving time corrections were negligible for the maximum counting 
rates (8000 c.p.m.) which occurred. 

Absorption in the source mount and in the active deposit should be negli- 
gible at the thicknesses used (Baker and Katz 1953; DeJuren and Chin 
1955a). It is estimated that the uncertainty of the standardization of Mn is 
+1%, mainly due to pipetting errors and non-uniformity of the source 
deposit. 

The 4x counting yielded the activity per milliliter in the stock solution. The 
rest of the solution was stirred into the MnSO, bath, and the response of 
the multicellular counter to the known Mn*® activity was observed. This 
calibrated the counter in the bath. Such calibration of the counter—bath 
assembly was made before each activation of the bath by the neutron source 
to be standardized; the actual value was around 66 Mn°® disintegrations per 
second for one count per minute as recorded by the multicellular counter. This 
figure changed very little for the various runs. The neutron emission rate 
from the source was then calculated using Equation 1 with the cross-section 
values as listed in Table I. With a MnSO, concentration of 30.31% the 
capture rate becomes 


Q = 1.985 Qmn- 


Since nearly half of the total neutrons are being captured in manganese the 
contribution of the cross-section error in the final result is reduced by a 


factor of two. 


TABLE I 


Thermal neutron absorption cross sections 














Cross section, barns Reference 
H} 0.332+0.002 Hughes and Schwartz 1957 
S 0.49 Hughes and Harvey 1955 
Mn* 13.25+0.2 Hughes and Harvey 1955 
(Best estimated value) DeJuren and Chin 19556 


Cummins and Spurway 1957 


CORRECTIONS 
If no neutrons were lost by other processes, the capture rate, Q, in the 
bath could be equated to the emission rate of the source. Actually small 
losses of several kinds occur, and these must first be taken into account. 


Fast-Neutron Capture 

At energies above a few million electron volts, neutrons can undergo nuclear 
reactions with the oxygen and sulphur present in the bath. The neutrons that 
interact in this way do not reach thermal energies and hence do not contri- 
bute to the activation of the manganese. This loss of fast neutrons must be 
estimated and included in the total emission rate of the source. 
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The fast-neutron loss in water for the case of a Ra-a-Be source has been 
determined experimentally by de Troyer and Tavernier (1954), who compared 
the slowing-down density of 1.44-ev neutrons in water and paraffin oil (which 
contains no oxygen). Their results indicate that (2.2+0.3)% of the neutrons 
emitted by the source were absorbed at high energies by the oxygen in the 
water.’ In the absence of similar measurements in MnSQ,, the additional 
absorption in the sulphur can only be estimated from a comparison of the 
sulphur- and oxygen-absorption cross sections over the limited range of energies 
for which they have been measured. If the resonance escape probability in 
water is put equal to 0.978 on the basis of the measurements of de Troyer 
and Tavernier, an average absorption cross section in oxygen between 4 Mev 
(below which it is known to be negligible) and 13 Mev (the upper limit of 
the Ra-a-Be neutron spectrum) can be deduced; the result is 0.2, barns. The 
(n, p) and (n,qa@) cross sections in sulphur have been measured between 2 
and 4 Mev (Hughes and Harvey 1955); their sum increases from 0 to about 
0.45 barns in this region. If, therefore, the average cross section between 2 
and 13 Mev is assumed to be between 3 and 4 times the oxygen cross section, 
the total fast neutron absorption in the MnSOy, solution amounts to (2.8 


+0.4)%. 


Absorption in the Neutron Source 

A correction must also be made for the absorption of thermal neutrons in 
the source itself. De Troyer and Tavernier (1954) have determined the 
amount of this absorption by measuring the effect of additional absorbing 
material placed around the source on the neutron density integrated over 
the volume of the bath. For the case of a source with a total absorption cross 
section of 1.0 cm* in a bath of pure water they found the absorption to be 
(0.7540.3)% of the emitted neutrons. The effect of the manganese in the 
bath used here is to reduce the thermal-neutron flux at any point by a factor 
of roughly 


2H.0/(2H,0+2nn) = 0.495 


from the value in pure water. (The D’s are the macroscopic absorption cross 
sections.) The total absorption cross section of the present source was 1.16 
cm’, so that the absorption in this case would amount to 


0.75 X0.495 & (1.16/1.0) = (0.440.2)%. 


Escape 
A third way in which neutrons can be lost is by escape through the boundaries 
of the bath. The magnitude of this loss has been estimated in two ways: 
(1) Larsson (1954) has found that the thermal-neutron density around a 
source in a water bath, multiplied by the square of the distance from the 
source, falls off exponentially at large distances. On the assumption that 
the area under the portion of the exponential curve extending bevond the 


‘De Troyer and Tavernier actually obtained a figure of 2.5%, but DeJuren (1958) pointed 
out a small error in their derivation of this figure. 
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boundaries of the tank represents the number of neutrons escaping from the 
tank, it is possible to use this relationship to estimate the fraction of the 
neutrons escaping. On the basis of Larsson’s data this fraction is given by 


f = 1.5 0107 


where 7 is the distance from the source to the edge of the tank in centimeters. 
Taking an average of this expression over the boundaries of the tank used in 
the present experiment gives f = (1.4+0.4)%. Since Larsson found that the 
addition of a neutron absorber (H;BO3) to the water in the tank did not 
alter this fraction appreciably, the above figure should also apply to the 
MnSQ, solution. (The slowing-down lengths in water and in the MnSO, 
solution do not differ by enough to affect this result.) 

(2) Rumsey and Ward (1948) have measured the total neutron capture 
in MnSO, baths of various sizes. Their data can be fitted by the following 


expression for the fractional escape: 


f = 19¢e 0.1057 


This also yields a correction of 1.4% in the present case. 


RESULTS AND DISCUSSION 
The results of seven runs, each consisting of a calibration of the counter 
in terms of disintegrations per second in the bath, followed by a measure- 
ment of the activity induced by the standard source, are as follows: 


Date (Neutrons/sec) 

7/5/58 3.22105 
14/5/58 3.21 
23/5/58 3.21 
16/7/58 3.22 
23/7/58 3.19 
30/7/58 3.26 
8/8/58 3.23 


Mean 3.22108 


The following are the estimated errors in the final result arising from various 


sources: 

Random counting errors 0.3% Correction for absorption in neutron 

Systematic errors in 4x counting 1.0 source 0.2% 

Manganese concentration in bath 0.3 Escape correction 0.4 
0.8 Resonance absorption in manganese 0.2 


Capture cross sections 
Correction for fast-neutron capture 0.4 


The result for the emission rate of the source is 
Q = (3.22+0.05) X 10® neutrons/second for July 1958. 


This can be compared with a value obtained by Chin and Caswell (1956), 
who compared the Canadian source with the National Bureau of Standards 
primary Ra-Be photoneutron standard (NBS I) in January 1956 using a 
manganese sulphate bath. After correction for fast-neutron capture in the 
case of the (a, ”)-source a neutron emission rate of (3.13+0.09) X10® per 
second was obtained for the Canadian source. Corrected for growth of Po?!® 
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its emission rate would be 3.16X10® neutrons per second in July 1958. This 
is 2% lower than the present value, but still within the uncertainties of the 
measurements. With this comparison the N.R.C. source is tied into an inter- 
national comparison of primary standards which was carried out by Larsson 
(1958). It is planned to have the Canadian standard compared directly with 
a number of other national neutron standards, most of which are of the 


Ra-a-Be type. 
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FOUR-DIMENSIONAL ELECTROMAGNETIC RADIATORS! 


H. E. SHANKS AND R. W. BICKMORE 


ABSTRACT 


This paper describes an entirely new and fundamental technique which has 
wide application to many fields of the antenna art as well as important system 
implications. The essence of this philosophy is the utilization of the time domain 
as an additional variable with which to control antenna radiation characteristics. 
One way to accomplish this is by periodically time-modulating one or more of the 
antenna parameters (aperture excitation, aperture shapes, frequency, phase 
distribution, aperture size, etc.) in a prescribed way. The result of this modulation 
is a radiation pattern whose characteristics are periodically changing as a function 
of time. By virtue of the periodic nature of the pattern fluctuations, an infinite 
number of independent information channels are available corresponding to the 
harmonic frequency components of the modulated patterns. Since each harmonic 
will, in general, have a different space factor associated with it, proper data 
processing can be utilized to provide simultaneous operation of a single antenna in 
a variety of modes. In addition, this technique allows synthesis of antenna 
radiation characteristics which might be unobtainable by conventional methods. 


INTRODUCTION 

During the past 10 or so years the development of radiating systems has 
advanced to the point where a multitude of antenna techniques are available 
for any particular operational requirement. These techniques embrace the use 
of one-, two-, and even three-dimensional source distributions. However, even 
with all three spatial co-ordinates available, increasingly stringent system 
specifications cannot always be met. In other words, an additional degree of 
freedom—a fourth dimension—is needed to realize an order of magnitude 
improvement in antenna operation. The fourth dimension with which we are 
concerned here is just that so often mentioned in the popular literature—time. 

The essence of four-dimensional radiating systems is the periodic time 
modulation of one or more antenna parameters so as to increase the system 
information handling capacity or realize advanced radiation pattern charac- 
teristics. The modulated parameter can be the physical dimensions of the 
antenna, the energy distribution function over the antenna, the frequency or 
even the physical location of the phase center. An antenna technique investi- 
gated in the past (Cutrona 1955) is a practical example of the latter form of 
modulation. In this situation the antenna is physically moved and the received 
energy is processed to attain improved pattern characteristics. The necessity 
for processing is a function of the extremely low modulation rates (one or two 
cycles per second) involved. If the antenna modulation occurs at a much 
higher rate, processing takes place in a spatial sense and thus simplifies the 
system operation greatly. Application of these concepts to other parameters 
mentioned above results in exploitation of other pattern characteristics, for 
example, direction of beam collimation or sidelobe level. 

‘Manuscript received October 27, 1958. 

Contribution from the Microwave Laboratory, Hughes Aircraft Company, Culver City, 
California. 
Can. J. Phys. Vol. 37 (1959) 


ine 
263 








264 CANADIAN JOURNAL OF PHYSICS. VOL. 37, 1959 


GENERAL REMARKS 


In its most general form an antenna may be described as a transducer 
between a spatial distribution of energy and a temporal distribution of infor- 
mation. Although this definition is largely academic, it will serve to point up 
the differences between three-dimensional and four-dimensional operation. 
Under this definition an antenna may be considered as a two-terminal pair 
network with normal operation restricted to stationary and linear transfer 
functions. However, with time as an additional degree of freedom, the equiva- 
lent network is no longer stationary and may in fact be non-linear, the output 
information thus being a function of the non-stationary (modulated) nature 
of the antenna as well as the input signal. 

If a wave of energy is incident on an antenna whose parameters are modu- 
lated in a periodic manner, the voltage across the output terminals will be of 
the following form, 


(1) E(6, t) = «1 g(t) {b0(@)+1(8) cos wat+b2(8) cos Qwat+... fei! 


where 4 contains the radial dependence, 6 denotes the spatial variation in the 
signal, and the Fourier series containing the },(@) is the time-dependent 
radiation pattern with wa the fundamental modulation frequency. In (1), g(t) 
represents the input information, which takes different forms depending on 
the operational use; for radar g(t) is of a pulsed nature and contains the range 
information while in a communication link g(¢#) might represent an audio 
voice spectrum. In any case the basic time-modulated character of the pattern 
information is identical. 

For general modulation functions the space factors, 6,(@) associated with 
the harmonically related frequency components mw, will be different. Because 
of the orthogonal nature of the Fourier terms each term may be considered 
as an independent information channel capable of carrying different spatial 
dependent information. The implication of this is clear; if an antenna is 
properly modulated, a multitude of spatial patterns is generated, each one 
associated with its characteristic frequency component. By proper processing 
of this complex signal, the individual patterns may be separated and each one 
utilized in a normal manner. It is thus obvious that the information capacity 
of a single antenna can be enormously increased by the use of time domain 
techniques. 

The preceding remarks present a qualitative picture of four-dimensional 
antenna operation. However, the question arises: How must the antenna be 
modulated to realize the proper spatial pattern b,(@)? Consider a distribution 
of radiating sources spread over a surface So. The radiation pattern is 


(2) g(@) = ; W(s0) G(@|s0) dso 


where W(s9) is the complex distribution of energy over the surface, G(@|s9) is the 
Green’s function characteristic to the surface Sy and @ is the space variable. 
If W(so) is allowed to vary with time, that is W(so, #), then we desire the form 
of this variation to realize a prescribed set of 5,(6)’s in the radiation pattern 
g(0, t). 
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Under these conditions (2) may be rewritten with the time variation in- 
cluded as 


(3) g(6,t) = J W (so, t) G(@|s0) dso 
So 

and if the variation is periodic, with frequency w4, the distribution may be 
expanded 
(4) WV (so, t)= oo0(So) + b1(S0) cos watt 2(So) cos 2w4t+ are 
Substituting (4) into (3) results in 

ow . \ 
(5) g(6,t) = +» Gn(So) G(O\s0) dso Cos Nw al 

n=0 y So 


and if this expression is compared term by term with the time modulated 
pattern of (1), there obtains 


(6) 6, (0) = J on (So) G(8\so) dso. 
So 


In other words, each harmonic coefficient ¢,(so) in the distribution gives rise 
to the corresponding coefficient 5,(@) in the pattern, so that given the desired 
b,(0)'s the distribution V(so, t) is obtained from (6) and (4). 

The foregoing analysis and discussion presents the basic ideas of time 
domain operation. Nothing has yet been said about the data processing 
schemes to realize these objectives. Also, only modulation of the energy 
distribution parameters has been considered. These topics will be discussed in 
later sections. 

APPLICATIONS 

The operation of an antenna system with four-dimensional control will 
undoubtedly find numerous applications to military as well as commercial 
operations. Some of the most interesting characteristics which may be achieved 
by these techniques are: multipattern operation, simultaneous scanning, and 
sidelobe suppression. These applications will be discussed in some detail below. 


Multipattern Operation 

In many instances it would be highly desirable to have a single antenna 
capable of operation in a variety of widely different pattern modes. For 
example, in a ground-based radar, broad search capabilities might be desired 
along with accurate detection and tracking capacity. This capability is readily 
realized through time domain techniques by modulating the antenna in such 
a way as to cause the space factors associated with two harmonic pattern 
frequencies to be a pencil beam and a csc? or fan type pattern respectively. 
These pattern features would exist simultaneously and, by proper processing 
of the complex signal, either mode or both modes of operation can be selected. 

As a simple example of this concept, consider a linear array of length 2/o 
from which a pencil beam and a csc? pattern are desired. Then, in keeping with 
equation (1) and considering only the time-varying pattern, let 


Ao sin (k lo sin 6) 


bo(8) = sin 6 
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and 
5, (8) = Ay csc? 0. 


The complete time-varying pattern is then 


Ay sin (k Jo sin 8) 


: +41 csc" 6 cos wat 
sin 6 


(7) g(0,t) = 
and the antenna modulation necessary to realize this characteristic can be 
found by the procedure leading to equation (6). Although the pattern features 
shown in (7) are to be realized from a single array, it is enlightening to consider 
this array as broken down into two component elements. Figure 1 illustrates 
two linear arrays in symbolic form. One of these arrays is unmodulated and 


b, (8)=csc?@ 






MODUL ATOR 
sin(kL,sin@) 
b(0) = ——_2—— 
sin 
4 
INPUT 
_ 
INPUT 


Fic. 1. Symbolic representation of component arrays for multipattern operation. 


has the distribution necessary to give the first term of (7) (a pencil beam). The 
input to the second array is modulated as cos wat with a distribution to achieve 
the csc? @ characteristic. The pattern resulting from the combination is just 
that given by equation (7). These two arrays are then superimposed into an 
equivalent single array and the modulated distribution is achieved by a means 
which is, for the present, unspecified. It should be readily seen that similar 
reasoning could be applied to planar array antennas as well as dish-reflector 
type systems with similar results. 


Simultaneous Scanning 

In many military applications it would be extremely useful to have a 
non-scanning antenna which would detect and locate targets over a wide 
angular region. This is again readily available using time domain operation; 
in this case, the space factors 6,(@) being pencil beams each pointed in a 
slightly different direction. Again considering a linear array of length 2/o, let 
sin {k Jo [sin @—sin ™]} 
(8) KO) © 4 eee 

[sin @—sin 8,] 

that is, the pointing direction for each harmonic space factor different and 
directed at @,. 
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The time-varying pattern under this condition is 


(9) g(0,t) = > As sin {F lo [sin 6—sin @]} COS Nwat 
0 [sin @—sin 6,| 

where the A, are adjusted to make the return from the peak of each space 
pattern be equal. The number N and the values 6, would be chosen consistent 
with the maximum coverage desired and the necessary discrete angular 
accuracy. Data processing would separate the harmonics and feed the angular 
information to corresponding channels which would be used to drive the 
display apparatus. 


Sidelobe Suppression 

Preceding applications of the time domain have utilized modulation of the 
energy distribution function and have considered the use of more than one 
frequency harmonic. For the present application we consider only the time 
average pattern, b)(@), and determine modulation techniques which result in 
the time average having improved pattern characteristics (in particular low 
sidelobes). The utility of this concept is that the extreme mechanical tolerances 
necessary to realize low sidelobes in a non-modulated antenna may be relaxed 
by virtue of the additional degree of freedom provided by the time variation. 

This philosophy may be understood by considering a simple linear array as 
illustrated in Fig. 2. If this array has a uniform distribution, the sidelobe level 


- a ioicanman Sill in ainsi 


Fic. 2. Co-ordinate system for linear array. 


is 13.2 db and it will be shown that considerable reduction in this figure is 
obtained if the array length is modulated in a square wave manner. 

The pattern for the linear array of Fig. 2 is 
sin (k lo sin 6) 


(10) g(@) =A ap 








268 CANADIAN JOURNAL OF PHYSICS. VOL. 37, 1959 


and has a sidelobe level of 13.2 db. If it is assumed that the half array length 
is modulated in time as 


(11) 1 = 1f{1+8f(0)] 
where f(t) is the time modulation and @ is a length parameter, the time-varying 
pattern is 


sin {k Jy [1+6 f(t)] sin ah 


2 = A- 
(12) g(6, t) sin 6 


The time average is obtained by the common integral expression 


ie wl 
(13) (g(0, 1)) = bo(0) = 5 J (6,1) at 
where 7 is the modulation period. When (12) is substituted into (13) and the 
integral evaluated with f(t) a square wave 


: 1l0<t 
JW) = \_ (7/2 


a) 


RA 
K 
~ 


the resulting time average pattern is 
, sin {k /) sin 6} 


tee cos (k lo B sin 6). 


(14) by (0) as 
The multiplicative term cos (R/» 8 sin 6) has the effect of an additional pattern 
taper resulting in lowered sidelobes. The parameter @ is the controlling quantity 
in setting the sidelobe level. Figure 3 displays a power plot of equation (14) for 
judicious choices of this parameter. The optimum condition is seen to corre- 
spond to a 8 value of 0.25 giving a sidelobe level of approximately 23 db-—a 
10-db reduction due to time modulation. It should be emphasized that this 
order of magnitude reduction in sidelobe level is achieved by switching only 
between two differing patterns and with perhaps the simplest form of modu- 


lating function. 


ooo B=0.25 
444 B:0.50 
vvv B=0.00 


POWER IN DB ONE WAY 





0 «© 2 3” 47 5a 6” 
x(k, SIN @)UNITLESS 


Fic. 3. Patterns for array with time-varying length. 
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This analysis has definitely established the sidelobe suppression capability 
of time domain techniques; however, the particular example realizes only a 
10-db reduction. It is desirable to obtain a synthesis procedure by which a 
given sidelobe level can be realized. Considering again a uniform linear array, 
the integral form of the time average pattern can be expressed as 


1 aT el(t) 
(15) (g(0, 1) = = J J ett a8 ay a 
0 


—I(t) 
and we wish to determine the length variation /(¢t) so as to realize a desired 
pattern form, (g(6, t)), with given sidelobe levels. 
Such a procedure has been worked out for the linear array, and applied 
to obtaining a pattern with controllable sidelobes. A simple pattern of this 
type has been given in the literature (Taylor 1952). This pattern is 


(16) g(u) = 


where u is the normalized space variable and B is the sidelobe controlling 
factor; the first sidelobe voltage ratio being given by 


1.6 sinh rB/7B. 


The synthesis procedure is based on the analogy of (15) and the expression 
for the principal plane radiation pattern of a two-dimensional aperture. This 


expression is, 


abi eh 2 
(17 ) | ni J jkx sin 6 ( 
g(@) = ye dx ¢ dy 
ei Siy) * 
where f(y) represents the shape of the aperture and /# the maximum y dimen- 
sion. Comparison of (15) and (17) indicates the equivalence of ¢ and y through 
the relations 


(18) f(y) ~ lb), hw~ T. 


Thus, it is clear that time modulation of the length of a linear array is equiva- 
lent to adding a second dimension of control. Having established the equiva- 
lence of (18) it is possible to use two-dimensional aperture synthesis methods 
to arrive at the function f(y), (and hence /(¢) by (18)) to realize the pattern of 
(16). Figure 4 shows the general form of /(t) plotted on a universal time scale. 
The parameter which controls the sidelobe level in this case (in analogy with 
B of (16)) is the ratio of the period 7 to the pedestal height denoted by unity 


in the figure. 


SYSTEM REQUIREMENTS 
The previous section has described a number of applications of the time 
domain to advanced radiating system characteristics. These techniques have 
all been based on the ability of a time-modulated antenna to either increase 
the information-handling capability of the system or to realize improved 








270 CANADIAN JOURNAL OF PHYSICS. VOL. 37, 1959 


T UNITS 


_ —— 


Fic. 4. Length variation for Taylor pattern. 


radiation pattern characteristics. In order to capitalize on the increased versa- 
tility provided by these antenna techniques, due consideration must be given 
to the accompanying system and detection requirements. What modification 
to present system design procedures are necessary to accommodate four- 
dimensional antennas? How must the modulation frequency be related to 
other signal frequencies? All of these questions are of basic importance in 
the application of time domain antennas to any practical radiating system. 

It should be obvious that the solution to these and other pertinent problems 
could constitute a complete paper in themselves. Also, the particular opera- 
tional use (radar, communication, radio astronomy, etc.) under consideration 
is intimately connected with these solutions. As a result it will not be possible 
to cover all the system aspects in detail; instead a few general remarks will 
be given to illustrate possible system solutions. In addition, a detailed analysis 
of four-dimensional antenna operation in a voice communication link will be 
carried out. 

Basically the problem presented to the detection system is one of separating 
a large number of harmonic frequency components and at the same time re- 
taining the input signal information. This statement again points up the 
differences which arise because of the various operational uses, the input 
signal nature being different in each case. However, it seems reasonable to 
assume that a normal receiver and detector in conjunction with a bank of 
appropriate filters would be a starting point for any application of these 
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Fic. 5. Block diagram of system for four-dimensional antenna. 
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techniques. Figure 5 shows a symbolic block diagram of such a system. The 
incoming signal is incident on the antenna, modulated in the proper fashion, 
and then fed to a conventional receiver. From the receiver the information is 
sent to the filters for separation and then to the mode selection circuit 
which feeds the desired combination of signals to the display or control 
apparatus. The harmonic frequency separations and the filter bandpass charac- 
teristics would be chosen consistent with the signal information to be processed. 
Alternative schemes utilizing coherent detection techniques might also be 
called for in particular applications. It should be re-emphasized that this 
system is not proposed as a solution to a particular operational requirement but 
only as a symbolic illustration of potential system concepts. 

A more detailed analysis of a particular system will prove enlightening. 
Consider a voice communication antenna in which sidelobe suppression is 
artificially realized by modulation techniques. The input angular carrier 
frequency is chosen as wo, the voice spectrum extends from 100 c.p.s. to 3000 
c.p.s., and the fundamental modulation frequency is w4. Assuming a super- 
heterodyne receiver, the carrier frequency will be reduced to an intermediate 
frequency which will be denoted by w;r. Under these conditions the voltage 


incident on the detector is given by 


(19) E(6,t) = Ag(t){bo(@)+61(0) cos wat+b2(8) cos 2wat+...} Cos wyrt. 


Out of this complex signal we desire only the sidelobe suppressed pattern, 
along with the voice signal represented by g(t). Mathematically that is, 


(20) Ag(t) bo(0) cos wyrt. 


This is essentially a problem of designing the bandwidth of the receiver 
to pass the information of (20) while rejecting all remaining terms of (19). 
Rather than determine the necessary bandwidth, let us fix the bandwidth 
just sufficient to pass the upper voice frequency and determine the lower 
limit on w4 to achieve rejection. Under this condition the frequencies corre- 
sponding to the upper limit of the voice spectrum are w;r+3000 c.p.s. for dual 
sideband operation and the lower limit of the error frequencies (w,) is (wir 
+w,) #3000 c.p.s. We desire the lowest error frequency to be outside the upper 
voice frequency; mathematically, 


(21) @rptw, 3000 c.p.s. > wyr+3000 c.p.s. 
or 
(22) wa! > 6000 c.p.s. 


In other words for a receiver bandwidth of 3000 c.p.s. (SSB) or 6000 c.p.s- 
(double sideband) the lowest possible antenna modulation frequency is 6000 
c.p.s. Thus time domain patterns in this particular case can be detected with 
no modification in the detection system with due regard to the modulation 
frequency w4. Figure 6 is a graphical illustration of the situation just analyzed 


above. 
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Fic. 6. Frequency spectrum for time-modulated communications antenna. 


PHYSICAL CONFIGURATIONS AND EXPERIMENTAL RESULTS 
Many of the important aspects of four-dimensional radiating systems 
have been discussed earlier; there only remains to suggest possible physical 
antenna configurations capable of realizing the time modulation characteristics 
which might be required in various applications. These configurations will, 
obviously, take quite different forms depending on the type of antenna under 
consideration (diffracting, reflecting, or refracting). Array type antennas are 
a particularly attractive structure because of the capability of point by point 
control over the energy distribution function. This feature is very appropriate 
when modulation characteristics are desired as in the present instance. 
Reflecting and refracting type systems are not as versatile as the array antenna, 
but still lend themselves to modulation capacity. In the case of the reflector, 
the primary feed characteristics may be modulated in time while for the re- 
fracting structure, the lens material or position may be varied. Thus, modu- 

lation parameters are available in almost any type of antenna system. 
One of the simplest embodiments of the four-dimensional concept is found 
in the square wave switched linear array. A center-fed slot array is depicted 
in Fig. 7. The end portions of this array are fed through ferrite switches which 
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Fic. 7. Square-wave switched linear array. 
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are in turn driven by a square wave generator, resulting in an effective varia- 
tion in the array length. A system of these arrays could be readily stacked and 
synchronously modulated to form a two-dimensional radiating surface with 
improved sidelobe characteristics. As has been shown earlier, sidelobe sup- 
pression greater than 10 db can be achieved from linear arrays by considering 
modulations other than square wave. 

The most general and versatile four-dimensional system is one in which 
each element is individually controlled in amplitude and phase by its own 
modulating element. This is exhibited symbolically in Fig. 8 showing a two- 
dimensional array of radiators each fed through a single modulating element. 


RADIATING ELEMENTS 





“ MODULATORS 


\iseur 


Fic. 8. Generalized four-dimensional antenna. 


For low-frequency applications dipole elements would probably be used with 
ferrite or traveling wave tube modulating devices. At higher frequencies slot 
array techniques would be more appropriate. In this case each slot could be 
fed through its own branch line containing the modulation element or very 
recently developed techniques using ferrite excited slots (Shanks 1958) might 
be utilized. This concept of a large number of individually controlled elements 
represents the most refined four-dimensional structure and is capable of 
realizing any of the pattern characteristics mentioned in earlier sections. 

In order to illustrate the range of applicability of time domain operation, 
experimental tests have been made on a readily available modulated antenna. 
This antenna is just the common parabolic dish such as is used in airborne fire 
control systems. The modulation is provided by a nutating primary horn used 
in normal operation to achieve conical lobing. Intuitive considerations in- 
dicated that if the horn were nutating, the time average pattern (d-c.) would 
display a pencil beam shape while the spatial pattern associated with the 
fundamental frequency component would be characterized by a null in the 
on axis position. Figures 9-11 show the experimental patterns and bear out 
this reasoning. Figure 9 is a reference pattern taken with the horn stationary 
while Fig. 10 is the time average pattern with the feed rotating. The split 
beam pattern shown in Fig. 11 is the pattern obtained at the fundamental 
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modulation frequency and is seen to resemble the difference pattern of a 
monopulse antenna. In all cases the horn was rotating at 133 c.p.s. The actual 
experimental antenna is displayed in Fig. 12 along with the tunable bolometer 
amplifier used to filter the proper frequency components. These results are an 









28 
+ 
+ 
| 





8 


a 

| 
a | 
= | 
ws 10h-—+ +--+ -+-+-- +--+ eto + | 
S | | | ra 
Q 15} + ain contaen + | prt | + | 
z | | | 
a 20} + + ptt 4+—+>—+-_+— at — 
5 | | | | | | 
© os, 
x 
é 






SSS eo ar tt eS T | a ame 


WAY 





POWER PLOT 
$ 








dice 


a siscpleat fe OL | 
- 90° 80° 70° 60° 50° 40° 30° 20° 10° O° 10” 20° 30° 40° 50° 60° 70° 80°-90° 


POWER PLOT IN DB ONE WAY 









40) eaten inertial al 
- 90" 80° 70° 60* 50° 40” 30° 20° i0® O° 10° 20° 30°40° 50° 60° 70° 80°-90° 


Fic. 9. Reference pattern of conical-lobed antenna. 
Fic. 10. Direct-current response of conical-lobed antenna. 
Fic. 11. The 133-c.p.s. response of conical-lobed antenna. 
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excellent example of multipattern operation described earlier, a sum and 
difference type pattern both being generated simultaneously and associated 
with different frequency components. 





Fic. 12. Conical-lobed antenna and bolometer amplifier. 


CONCLUSIONS 

The concepts and techniques discussed in this paper represent one of the 
most interesting and promising new areas of investigation in the field of 
radiating systems. Without a doubt, the course of electromagnetic systems 
in future years will be influenced by this philosophy. However, effort in many 
different directions will be necessary to realize the full capabilities of these 
methods. Not only will extensive theoretical analysis be necessary on such 
questions as the accuracy with which the time modulation must be controlled, 
system analysis, synthesis procedures, etc., but in addition, considerable 
component research and development is indicated. 

The analyses given here have been deliberately chosen for simplicity; 
however, there is every reason to expect refined techniques to reveal similar 
results. It is hoped that this discussion will serve to stimulate antenna research 


in these directions. 
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ON THE THEORY OF FLOW OF UNDERGROUND FLUIDS 
IN COMPRESSIBLE STRATA! 


A. E. SCHEIDEGGER? 


ABSTRACT 


The problem of flow of a fluid within a compressible porous medium is in- 
vestigated. It is shown that in general, the motion of the fluid carinot be separated 
from that of the medium. This leads to a very complex problem of consolidation. 
However, considerable simplification can be made in applications to the flow 
of underground fluids. In that case, the general geometry of the consolidation 
can be predicted since the latter can take place in the vertical direction only. 
Furthermore, in many cases it is possible to neglect the volume compressibility 
of the porous matrix. 

Two cases have been considered: that of local isotropy of stress and permea- 
bility and that of local anisotropy of these two quantities. The basic differential 
flow equation for the two cases is deduced. 


1. INTRODUCTION 


Much attention has recently been paid to the flow of fluids through com- 
pressible porous media. Fields in which such media are important include not 
only the study of certain types of petroleum reservoirs but also the investiga- 
tion of ground-water resources and the problem of filtration through com- 
pacting filter cakes. 

If one is to treat the problem of flow through compressible porous media 
fully, one is not able to separate the motion of the fluid from that of the 
bulk of the porous mass. One has thus a twofold problem: the porous medium 
is undergoing finite deformations (governed by corresponding equations of 
motion), and coupled with it, the fluid also is being displaced. All displace- 
ments in question are large so that the whole apparatus of finite strain theory 
has to be invoked. This makes the problem very complicated. Some work 
on this general problem has been done in connection with studies of con- 
solidation of wet soil (Biot 1941, 1956; Florin 1948); the writer is not aware, 
however, of an occasion where the full finite strain mechanism would have 
been introduced and consistently employed. Furthermore, in consolidation 
studies the flow of fluid contained in the pores is only of minor interest, the 
flow velocity is usually eliminated altogether, and one is left with equations 
which refer to the deformation of the porous mass as a whole. 

If the flow of the fluid is of primary interest, one must reintroduce it after 
the problem of consolidation of the medium has been solved, and then solve 
a new set of differential equations. 

In many practical cases, particularly with regard to the study of petroleum 
reservoirs and other fluid-bearing strata within the earth, it is not necessary 
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to treat the problem in its full generality. The displacements due to consoli- 
dation are usually small compared with the displacements due to the flow of 
fluid. Furthermore, it is possible to predict at least the general geometry in 
which the consolidation will occur so that it is not necessary to solve a complete 
set of differential equations of motion to determine the latter. 

Furthermore, an additional simplification is obtained if it is assumed that the 
change of the matrix volume as compared with that of the pore volume can 
be neglected. In many geological applications, this is possible. 

Therefore, what is required for the practical study of the flow of fluids in 
compressible strata is a set of equations of motion for the fluid wherein due 
allowance is made for the concurrent deformation of the porous medium 
containing the fluid. There are two possibilities that come to one’s mind: 
either the permeability and the stresses are locally isotropic, or they are not 
isotropic. One would think, offhand, that it might be possible to treat the 
porous medium as locally isotropic with regard to permeability and stresses, 
but recent investigations by Hubbert (1957) show that this is not the case. 
The possibility of anisotropy occurring must therefore be investigated. 


2. STRESSES IN POROUS MEDIA 

If porous media are treated as deformable bodies, one is first of all faced 
with the problem of describing the deformations sustained as functions of 
the prevailing stresses. The fofa/ stress in a porous medium (Terzaghi 1951) 
can be split into two components: one component, termed the neutral stress, 
is the stress in the fluid; the second component, called the effective stress, is 
the difference between the total stress prevailing in the fluid-filled porous 
medium and the neutral stress. It is this stress which produces the deformation 
of the porous medium. 

In the case that the total stress in the porous medium can be assumed as 
hydrostatic, this total stress in the porous medium is called overburden pressure. 
The stress state in the fluid-filled porous medium is then completely described 
by giving any two of the following three quantities: overburden pressure, 
(3), effective pressure (7), neutral (fluid) pressure (p). The relation between 
these three quantities is 


(2.1) x = r(1—P)+pP 


where P is the porosity. The last equation is, in fact, a definition of the 
“effective” pressure; the latter also can be (and has been) defined differently. 

It is clear from the above remarks that, in order to describe the stress state 
in a fluid-filled porous medium, one needs all in all seven parameters: the six 
parameters of the (symmetric) total stress tensor 7,, and the fluid pressure p. 
If we have applications to underground strata in mind, it will be possible in 
most cases to regard the vertical direction as a principal stress direction; the 
reasons for this are outlined, e.g. in Anderson's (1942) book. The other two 
principal directions, then, lie somewhere in the horizontal plane. 

Using a Cartesian co-ordinate system x, y, 2 with z denoting the vertical 
and x, y principal axes, one has, under the present simplifying assumptions, 
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three significant stress components: 7,;, tyy, and 7,,; the other stress com- 
ponents, viz. Try, Trz, and 7,, are zero. The seven parameters, therefore, reduce 
to four (viz. Tr, Ty,;, Tzz, and p). A further simplification is achieved if one 
assumes the stress state as isotropic; one then has only two significant para- 
meters (overburden pressure and fluid pressure). 

The next task is to describe the deformation of the porous medium during 
compaction by some proper means. To this end, we follow the general method 
in finite strain theory (cf. Scheidegger 1956) of introducing ‘“‘parameters”’ 
&. These parameters are the Cartesian co-ordinates of the porous medium in 
an arbitrary state (e.g. under zero overburden stress and zero fluid pressure). 
The parameters & will be the variables to which everything will ultimately 
be referred. 

Let the co-ordinates of the point & in the stressed state be x,(é). Then 
we have 
(2.2) Ly =A RPA): 


(2.3) £0 = Ea(Xi, Tiky Pp, P5 


where it is implied that the rheological condition of the porous medium is 
such that the latter's deformation is entirely given if a certain value is assumed 
for the total stress 7, the fluid pressure p, and the fluid content (volume 
porosity P). This, of course, neglects all the inertia forces connected with 
the motion of the medium. However, we allow for the possibility of hysteresis 
by specifying single valuedness for the x’s only with regard to a certain 
direction from which the stress state in question was attained. 

With the above equipment, one is able to proceed to the hydrodynamics 
of the fluid contained in the pores. 


3. THE HYDRODYNAMIC EQUATIONS 

We now proceed to set up the hydrodynamic equations of the fluids. The 
fluid will be connected with the porous medium during the latter’s consoli- 
dation. What is of ultimate interest is only the motion of the fluid in terms 
of the parameters &. As is well known (cf. the writer’s book (Scheidegger 
1957)), one needs three sets of conditions to make any flow problem deter- 
mined: (i) Darcy’s law, (ii) the constitutive equation of the fluid, and (iii) 
the continuity condition. The flow equations should be set up in such a 
fashion that the various quantities occurring therein can be conveniently 


measured. 
Beginning with Darcy's law, we assume that it can be written as follows 
: kas O 
(3.1) ae ee 
BK 0&3 


where the summation convention is used. The quantity gq is the filter velocity 
vector as referred to the filtration surface in the zero strain state, u the 
viscosity, and p the pressure of the liquid. The quantity kag is the (variable) 
permeability tensor; there is some ambiguity as to whether or not it would 
have to be taken inside the gradient. This question, however, also occurs in 
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the discussion of relative permeability and a dissertation of why the form 
given above should be adopted over other possibilities may be found on p. 62 
of the writer’s book (Scheidegger 1957). 

The constitutive equation of the fluid is simply 
(3.2) p = p(p) 
where p is the density, p the pressure. 

This is the same constitutive equation as that encountered in flow through 
incompressible porous media. 

Finally, we turn our attention to the continuity condition. Expressing the 
condition that the mass flow out of a small volume of d*t of porous medium 
(with vector-surface elements dA.) must be equal to the change of mass 
inside yields 


(3.3) $ on. dAq = -2 f +e)pra’s 


where P is the porosity at the point £, and (1+x«) is the volume factor induced 
by the consolidation of the porous medium. In virtue of Gauss’s theorem, 
the last equation can be written as follows: 


r 0 3 0 
(3.4) J ag, Pa dt = -f2 [((l+x)pP] dé 
or 
4 0 0 : 
(3.5) ee (pga) = ar [((1+«)pP]. 


This is the continuity condition. 
Combining the three basic equations, one ends up with an equation for 


p only. 


a \eP} - -2-| 2 26 
(3.6) . [((1+kx)pP] = at. |? Rap at,” 


This is the fundamental hydrodynamic equation. 

Equation (3.6) can be further simplified if it can be assumed that the total 
change of bulk volume in any one specimen is entirely produced by change 
of the pore volume, i.e. if it can be assumed that the volume of the solid 
matrix changes only negligibly. Then, one can write 


(3.7) bulk volume (p, 7) = bulk volumey (1+x«(p, 7)), 


(3.8) pore volume |, = Po bulk volumeo, 


» _ pore volume (x, p) ____ pore volume (x, p) __ 
bulk volume (z, p) — bulk volumeo (1+x«(, 7)) 


(3.9) 
but: 

pore volume (z, p)—pore volume (0) 
(3.10) ; = bulk volume (zx, p) —bulk volume (0) 

| = bulk volumeo [(1+«(p, 7)) —1] = «(p, r) bulk volumeo; 
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furthermore: 

(3.11) pore volume (2, p) = bulk volumey («(p, 7))-+pore volume, 
(3.12) pore volume (z, p) = bulk volume (Po+x(p, 7)); 

hence: 

(3.13) P = [bulk volumeo (Po+k(p, 7))|/[bulk volumeo (1+«(p, 7))], 
or 

(3.14) P = [Potk(p, r)]/[L+«(d, 7)]. 


Therefore, in the simplified case at present under consideration, the funda- 
mental hydrodynamic equation reduces to 
Orie tes 0 [p Op 
3.15) : Potk(p, x = eee @ a a 
( at [o(Pot (p ))] at. \u ap dks 
4. THE CASE OF LOCALLY ISOTROPIC STRESS AND PERMEABILITY 

Let us consider first the case of locally isotropic stress and permeability. 
In that case, according to earlier remarks, the stress state is described by 
two parameters, viz. by the fluid pressure p and by the overburden pressure 
a. The various parameters required in the flow equation (3.6) can be deter- 
mined as follows: 


(a) Permeability 

Let a unit (at zero strain) volume be enclosed in a pressure cell. The para- 
meters & then correspond to this state. Then, the pressure is raised to 7, p 
and one measures a certain flow. Let the volume reduction be by the relative 
amount —x«(m, p), i.e. the length reduction by —43(x, p) (neglecting squares 
of x). What is measured is a superficial permeability ky), according to the 
formula 


Q — Reup p2— pr 
(4.1) = 

Ao pe La 4 
where the index ‘‘0’’ stands to indicate that the original (zero strain) area 
and length are used to calculate Ryuyp. The true permeability of the sample, 


assuming cubic volume change, is given by 


Q Rirue __p2—pr 


9 a eee eee eens 
a) Adit op lal) 
hence 

‘ . I+«/3 _ 3+ - ( = 
(4.3) Reve a | 1+2«/3 er 3+2k Rus — 1 3 haw 


where the last equal sign has been obtained upon neglecting high order terms. 

However, the volume change in the field is not cubic, although it is cubic 
in the pressure cell. According to earlier remarks, the compaction occurs in 
the vertical (¢) direction only. Assuming the true permeability under pressure 
to be the same in the field as in the pressure cell, we can express the con- 
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dition for the “‘superficial’’ flow dgup, i.e. the flow in terms of zero strain 
co-ordinates of the field. We have 


ad Qz(true) ie d p2— Pr Rerue 2 / , 

(4.4) Ysup z —_ ZZ — ia ia fn Lo (1+k) 
and hence 

7 1+«/3 
(4.5) Ree = Ry = (1+k) Rime = (1+«) 1324/3 Reup- 
Neglecting squares of x: 

3+4k 2k 

(4.6) Re: = 3+2k Rean = (ui2) | 
Similarly 

” == Q:(true) = J _p2—Ppr Rerue : 
(4.7) Ysup z = f = GG Lo(1+k) mn L 


and hence 


sail __1 1t«/3 
(4.8) key ae 1px Peme — 1+« 1+2«/3 sup: 
Neglecting squares of x: 
1+«/3 3+« ( as) 
i soo = oOo = _— 
(4.9) Rey 1+5«/3 Res 345% Wiss 1 3 om 


The permeability tensor in the system &, 7, ¢ (¢ vertical) has therefore 
the following form 


2 0 0 
I+3 K 
2 
(4.10) Rua = 0 1+; K 0 
0 0 1-3 K 


It is therefore seen that, in order to determine kag one needs one permeability 
measurement (Rup) and one measurement of the bulk compressibility for 
each possible pair of variables p, x. During the depletion of underground 
strata of their fluid content, it can be assumed that the overburden pressure 
stays constant; the measurements of Ry, and x have therefore to be made 
with given overburden pressure zm for all values of p. 


(b) Porosity 

Another parameter that enters into eq. (3.6) is the combination (1+x)pP. 
It is customary to measure the differential quotient (1/Po)(d/dp)[(1+«)P] 
where the index 0 refers to the zero strain state. This differential quotient 
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has been called “effective compressibility’’ (McLatchie, Hemstock, and 
Young 1958). If the latter be measured for one series of values 7, p arriving 
at the field overburden pressure and fluid pressure from the zero strain state, 
and then for the given overburden pressure for all pertinent values of the 
fluid pressure p, and furthermore, if the rheological equation of the fluid p(p) 
be known, then the required values of (1-+«)pP can be obtained by inte- 
gration. It is then possible to eliminate all unknowns except p from (3.6) and 
hence, to solve any underground flow problem. 


5. ANISOTROPIC STRESS AND PERMEABILITY 

It is interesting to note the modifications that are required in the above 
scheme if one is to generalize it to take care of anisotropic stresses and per- 
meability. The general equation of flow (3.6) is, of course, still applicable, 
but different means have to be found to measure the required quantities. 

In order to make the problem amenable to investigation at all, it is necessary 
to assume that the vertical direction is a principal direction for stress and 
for permeability. The other principal axes then lie somewhere in the horizontal 
plane. 

In order to measure the quantities required in equation (3.6), a means 
must be found to restore in the laboratory the state in which the rock was 
in the field. In general, this requires a separation of the three principal 
stresses from one another. Thus, a piece of rock being taken from a well 
will, first of all, expand cubically (because it is taken out of confinement) 
while it is in transit to the laboratory. Then it must be enclosed in some 
device in which the total stresses 7,, and 7,;, T,,, as well as the fluid pressure 
p can be brought up to the values which prevailed in the field. This reproduces 
the original state of the rock. For the following, it is useful to refer the para- 
meters & to this state rather than to the zero strain state. 

Afterward, during a test run, 7,, must be held constant, the lateral stresses 
(rrr, Tyy) must be adjusted in such a manner that there is no lateral motion 
of the rock (which is the boundary condition prevailing in the field) and 
the vertical (k.) and horizontal (k,, k,) permeabilities must be measured 
separately for all values of the fluid pressure p. This yields immediately the 


permeability tensor kag, viz. 


k, 0 0 
(9.1) kas = | 0 k, 0 
0 0 k, 


if &, the lengths, and the areas are referred to the original field state as 
defined above (rather than the zero strain state). The other quantity that is 
required in the flow equation is [(1+«)pP]; it can be measured just as in the 
isotropic case. Since the & now refer to the field state, the effective com- 
pressibility must be measured with this field state as basic state (Po referring 
to this state). It may be noted that it is now no longer necessary to obtain the 
bulk compressibility and the ‘‘effective’’ (pore) compressibility separately, 
because an additional permeability measurement has been substituted for same. 
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Again, with the various quantities measured, all unknowns except p can 
be eliminated in the fundamental flow equation (3.6) and hence it is possible 
to solve any underground flow problem. 

6. DISCUSSION 

The preceding analysis enables one to draw some interesting conclusions 
with regard to practical requirements in petroleum engineering and in hydro- 
logical studies. These are: 

(i) The problem of flow of fluids through consolidating porous media is 
quite complicated in its full generality, but for practical application to the 
study of the flow of underground fluids, substantial simplifications are per- 
missible. One thus can assume that the compaction takes place in the vertical 
direction only. 

(ii) If it is further assumed that the stress state in the porous medium is 
hydrostatic, and the permeability locally isotropic, then the basic differential 
equation for the flow contains three independent quantities (in addition to 
the pressure—-density relationships of the fluid) that have to be measured as 
functions of the pressure and overburden pressure. These are (for instance) 
the bulk-volume reduction « of the medium, the superficial permeability Rup, 
and the effective compressibility (1/P»)d{(1+«) P]/dp, the latter being defined 
as the change of pore volume per unit (zero strain) pore volume contained 
in the medium per unit (fluid) pressure change. Furthermore, the pressure- 
density relationship for the fluid must be known. It may be noted that the 
effective compressibility is not required for the solution of steady-state 
problems. 

(iii) If it is conceded that the permeability as well as the stress may be 
different in the horizontal direction from that in the vertical direction, then 
one can note the following facts: 

(a) The variables are now four instead of two: the fluid pressure p (just 
as before) and three values of the overburden stress rz, Tyy, and Te; (instead 
of the overburden pressure 7). 

(6) There are now seven quantities instead of three quantities that have 
to be measured: (i) the effective compressibility just as before; (ii) three 
permeability values (in the horizontal and two principal vertical directions) 
instead of one as before, and (iii) ¢hree quantities (change of length in the 
vertical and in the two principal horizontal directions, respectively) which 
are taking the place of the bulk compressibility. If it is again assumed that 
compaction in the field can occur in the vertical direction only, then the 
last three quantities reduce to one (to the compaction in the vertical direction), 
which can be omitted entirely if flow problems only are of interest as distinct 
from the compaction of the medium, but the stress states that are permissible 
are then only those which do not produce any lateral motion. 

With the required quantities known as functions of pressure and over- 
burden stress, it is in principle possible to solve any underground flow 


problem. 
It remains to compare the above investigations with other recent attempts 
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at elucidating the effect of fluid pressure decline on the volume changes of 
porous rocks. It may be noted that Geertsma (1957) noted that two different 
compressibilities have to be measured to describe properly the volume changes 
of a porous medium under compression. However, Geertsma does not proceed 
to write down the equations of motion. He therefore introduces ‘‘viscous’’ 
deformations of the medium. It is the contention of the writer that these 
‘viscous’ deformations are the automatic outcome of the general theory of 
consolidation. 

Similarly, Barenblatt and Krylov (1955) wrote down an equation of flow 
in fluids in deformable porous media. In essence, their equation is expressed 
in terms of partial derivatives of the porosity with regard to the fluid pressure 
(the mean effective pressure being held constant) and with regard to the 
mean effective pressure (the fluid pressure being held constant). This, of 
course, yields an equivalent expression to that given in this paper, except 
that the choice of variables makes it rather difficult to adapt the formulas 


to practical cases. 
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AN EXPERIMENTAL STUDY OF INTERCEPTION NOISE IN 
ELECTRON STREAMS AT MICROWAVE FREQUENCIES! 


B. A. McIntTosnH? 


ABSTRACT 


The noise added to an electron stream by the interception of a fraction of the 
current has been studied experimentally at a frequency of 3 kMc/s. The electron 
beam was produced in a demountable vacuum system by a parallel-flow Pierce 
gun in a confining magnetic field. A series of circular apertures and mesh grids 
on a plate capable of being moved within the vacuum chamber allowed the 
interception of various fractions of the total beam current. The excess noise 
caused by interception was measured at the anode of the electron gun and at 
various points in a drift region. 

Interception noise caused by mesh grids was found to be of much greater mag- 
nitude than that caused by circular apertures. The absolute level of the excess 
noise and its variation with the strength of the confining magnetic field were 
found to be in reasonable agreement with existing theory as modified by the 
author. 

It was shown that current interception excites a standing wave of noise along 
the electron beam in the drift region. Measurements made on a temperature- 
limited electron beam have shown that interception noise can arise ina region 
where noise smoothing is a consequence of processes external to the electron 


gun. 


I. INTRODUCTION 


It has long been known that if a portion of the electron current in a vacuum 
tube is intercepted by a structural element, the inherent noise in the remaining 
portion of the electron stream is increased. Theoretical predictions of the 
increased noise in multigrid tubes as opposed to triodes were verified experi- 
mentally (North et a/. 1940). These measurements were made on common 
radio receiving tubes at low frequencies. 

As a result of the recent development of beam-type amplifier tubes for 
use at microwave frequencies, the role of interception noise in determining 
the noise properties of such tubes is of interest. Excess noise caused by current 
interception has been observed experimentally in microwave tubes (Cutler 
and Quate 1950; Fried and Smullin 1954). These workers noted that the 
interception of a cylindrical beam by a circular aperture caused much less 
excess noise than would be predicted by North’s theory, but comprehensive 
experimental measurements have been lacking. North’s theory was thought 
to be inapplicable to microwave tubes since it was based on a particular 
mechanism of noise smoothing which was believed to be invalid at high 
frequencies. North also assumed that current was intercepted by a fine mesh 
grid. 

A significant contribution to the theory of interception noise has been 
made by Beam (1955). He calculated the excess noise caused by interception 
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of an electron beam by a circular aperture for the case of a beam collimated 
by an axial magnetic field. Since most microwave tubes require a magnetic 
field to confine the long electron beams used, such theory has considerable 
practical importance. Beam presented experimental measurements to verify 
the functional variation of interception noise with magnetic field strength 
but made no attempt to verify predictions concerning the magnitude of the 
excess noise. 

It is well known that when an electron beam drifts at constant velocity 
in a field-free region a standing wave of noise exists on the beam. Robinson 
(1954) postulated that interception would excite a noise standing wave but 
experimental verification of this phenomenon has not been reported in the 
literature. A knowledge of the behavior of the interception-noise wave in 
the drift space is of importance in minimizing the noise figure of beam-type 
tubes such as the travelling-wave tube. 

It has been pointed out by Beam (1955) that a theory of interception 
noise based on random probability of interception of electrons leads to the 
same result as North’s theory yet requires no assumptions concerning the 
mechanism of smoothing as did North’s theory. Experimental verification 
was again lacking. 


II. THEORY 


Although this research has been primarily of an experimental nature, some 
attempt has been made to clarify and extend existing theories. North’s theory 
of interception noise (North et al. 1940) is based on his conception of the 
smoothing action of the potential minimum which lies immediately in front 
of the electron-gun cathode when the emission is space-charge limited. Fluctua- 
tions in the total emission current were believed to be reduced in magnitude 
by ‘‘compensating pulses’”’ of current produced by the potential minimum. 
Interception of a fraction of the current by a mesh grid destroys the correlation 
between the initial fluctuations and the compensating fluctuations and thereby 
caused the noise to be increased. The validity of this theory was therefore 
in doubt when it was found that the potential minimum played little part in 
the noise-smoothing process at microwave frequencies. 

Robinson and Kompfner (1951) have given a simplified derivation of 
North’s result, which may be written in the form 


(1) ian = Ty? Ro2eZnAf+(1—T1") (1 —ho) Ro2eZ or Af 


where ;;, is the mean square of the noise-current fluctuations that exist in 
the beam after it has been intercepted; 
e is the electron charge; 

Io, is the direct current before interception; 

ky is the fraction of the current Jo; which is transmitted through the 
intercepting grid; 

Af is the frequency bandwidth in which the noise is observed; 

rr’ is called the ‘‘smoothing factor’ of the electron beam before 


interception. 
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The smoothing factor may be defined as the ratio of the actual noise power 
in the beam to full shot-noise power in the same direct current; that is, 


(The subscript ; is used to denote quantities before interception; 2 for quantities 
after interception.) The smoothing factor of the beam after interception may 
be written 


(2) ry = 5-2 = rit(1-T?)(1-ky). 


It has been demonstrated by Beam (1955) that the theory of interception 
noise can be based solely on the assumption of random interception of electrons 
without recourse to any assumptions concerning the nature of the smoothing 
process. North’s result is in fact applicable to high-frequency tubes provided 
the intercepting electrode is a fine mesh grid, which is to say that the proba- 
bility of interception of electrons must be the same at all points on a cross 
section of the electron stream. The author has shown (McIntosh 1958) that 
a criterion for the necessary ‘‘fineness’’ of mesh may be set up in terms of 
the concept of ‘‘mean spreading radius” of electrons. This is essentially the 
radial distance about any point on a cross section of the electron stream, 
over which the probability of random interception is uniform; it is determined 
by the distribution of random transverse velocities of the electrons. If the 
dimensions of the grid openings are larger than the mean spreading radius, 
as in the case of a single circular aperture, the effective area over which 
interception of electrons will be random is much smaller than the total beam 
area. The excess noise added to the beam is therefore much smaller than the 
North value. 

The probability that an electron at radius 7 in the beam be transmitted 
through an aperture of radius r, has the form sketched in Fig. l(a). The 
region in which transmission or interception may be random is essentially 
determined by the product ko(r)[1 — o(r)] shown in Fig. 1(). If these factors 
are averaged over the total beam area they yield respectively the fraction 
of current transmitted and the fraction of the current in which interception 
is random. Following Beam’s notation (Beam 1955), these are 


, _ {I ko(rd 
(8) oa 

_ [Jhb I= o(MA 
(4) Qe — fJ(r)dA . 


J(r) is the direct current density which in general may be a function of trans- 
verse co-ordinates. The integration is over the entire beam area. 
The smoothing factor after interception may then be written 


ds 
1 


aA Ks 


(5) Tr,” = r,+(1-T1’) 
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Fic. 1. (a) The probability that an electron associated with a point at distance r from the 
center of the beam will be transmitted through an aperture of radius fa. 
(b) A plot of the region, near the aperture edge, in which interception or trans- 
mission of electrons is random. 


‘valuation of Q, and Q: 

To simplify calculation, it may be assumed that the current density J(r) 
is constant out to the beam edge at r = 7 and is zero thereafter. Then for 
all practical purposes, 


r 2 
(6) Q) = («) » Goh. 
» 


In cases where r, is nearly equal to 7), 2 is approximately unity but 1—Q, is 
of significance and may be calculated from the equations given by Beam. 

The author has shown (McIntosh 1958) that an approximate expression 
for Q2 is 


(7) Q, = ar,/8ar,{1+erf[2(r)a(rm—r,)]} 


where ‘‘erf’’ stands for “error function of’. The parameter a is the inverse 
of the mean spreading radius. 

When the electron beam is confined by a magnetic field, transverse thermal 
velocities cause the electrons to execute spiral trajectories. The Maxwellian 
distribution of spiral radii determines the mean spreading radius as 
1 _ mer)! 


(8) ‘ Be 


where m is the mass of the electron, 
k_ Boltzmann’s constant, 
T, the cathode temperature, 
Bis the strength of the confining magnetic field. 
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If no magnetic field exists, the mean spreading radius is determined by the 
transit time 7 from the cathode to the plane of interception. For a parallel- 
flow, Pierce-type electron gun there results 


1 aT 
( a eee 
(9) mn ( m . 


Of most interest to the designer of beam-type tubes is the magnitude of 
interception noise for small amounts of current intercepted by an aperture 
near the beam edge. In this case it is unrealistic to assume that the beam 
has a rectangular current-density profile since, with a finite confining field, 
the spiralling of electrons near the edge of the beam tends to “smear out” 
the edge. It may be shown (McIntosh 1958) that the radial dependence of 
current density is given by 


(10) J(r) = 3Jo{1—erfla(r,—r)]} 
where Jo is the average current density given by 
I 
Jen te. 
Ty 


Using this expression, modified values of Q: have been calculated and are 
compared in Table I with values given by Beam (1955). 


TABLE I 


Calculated values of Q: modified to take into account radial variation of current density 














ara (ra/?r) —1 Q modified Q Beam 
50 —0.01 1.0X107 1.3X10? 
0 0.59 0.80 
0.01 0.24 0.35 
20 —0.025 2.5 2.9 
0 1.5 2.0 
0.025 0.59 0.80 
10 —0.05 5.0 6.0 
0 2.9 3.9 
0.05 1.2 rue 


Although this first-order correction yields a reduction in Q: of only 1 to 
2 db, it does indicate that the thinning-out of current density near the beam 
edge tends to reduce the effect of interception. 


1 pplication to Mesh Grids 

Beam’s theory may be extended to mesh grids in the following manner. 
The electron beam may be divided into a bundle of smaller beams each of 
which is associated with one opening in the grid. The fraction of current 
intercepted from one of the small beams is equal to the fraction intercepted 
from the whole stream provided the current density is reasonably uniform 
over each of the small beams. A smoothing factor T'.? may be calculated for 
one of the small beams in the same manner as for a single aperture, and 
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since [’.? is a measure of noise per unit current, the value thus calculated 
applies to the whole beam. This analysis neglects possible interaction among 
the small beams, i.e., the fact that the small beams have no “edge’’, and 
that the radii of spiralling electrons may be large enough to overlap several 
grid modules. Such effects should be small if the mean spreading radius 1/a 
is less than the radius of one of the grid openings. When 1/a becomes greater 
than this radius, the excess noise will approach the North value. 


III. APPARATUS 
The Mechanical System 

Figure 2 (a) is a photograph of the apparatus used to make interception- 
noise measurements. The important and central portion of the apparatus is 
a vacuum chamber, maintained at a pressure of 3X10-§ mm of mercury, in 
which an electron gun and an intercepting electrode could be moved longi- 
tudinally with respect to a resonant cavity mounted on the right end of the 
long brass chamber. The large solenoid, mounted on wheels and capable of 
being rolled over the vacuum chamber, served to collimate the electron 
beam. The rack-like table was used to mount electronic apparatus. 

Figure 2(b) is an exploded view of the components within the vacuum 
chamber; the outer vacuum jacket has been removed. The electron gun and 
intercepting electrodes were mounted on frames which slid on teflon inserts 
along the inner surface of the brass tube seen at right. Over a longitudinal 
distance of 40 centimeters in which the motion took place, the lateral deviation 
of the sliding structures was less than 0.07 millimeters. 

The intercepting electrodes used were three apertures—of diameters .080 in., 
.060 in., .040 in.—and two mesh grids. The grids were a standard type used 
by Varian Associates in the manufacture of klystrons. One of the grids was 
mounted perpendicular to the beam axis, the other at an angle of 20° in order 
to intercept a larger fraction of the beam current. All electrodes were assembled 
on a single plate which pivoted on the plunger controlling the longitudinal 
motion. By rotating the plunger, any one of the electrodes could be moved 
into the path of the beam. 

The resonant cavity was a standard re-entrant type with a resonant fre- 
quency of 3050 Mc/s. Mounted directly behind the cavity, on the outside 
of the circular brass plate seen in Fig. 2(0), is the electron collector. It was 
surrounded by a soft-iron magnetic shield in order to eliminate the spurious 
noise caused by reflected secondary electrons passing back through the cavity 
gap (Kornelsen 1957). 

The electron gun initially used in the noise measurement followed a design 
originated by Kornelsen (1957) especially for use in demountable vacuum 
systems. It employed an indirectly heated tantalum emitter which was not 
subject to contamination at the moderate vacuum (10° to 10~*mm Hg) 
attainable in a demountable system. Nor was it affected by repeated exposure 
to atmospheric pressure when changes had to be made in the system. For 
measurements of the variation of interception noise with magnetic field 
strength, an electron gun with an oxide-coated cathode was used since the 
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Fic. 2. (a2) A photograph of the apparatus used to measure interception noise. 
(6) An exploded view of the internal components of the tube structure. 
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bombarded-cathode gun did not operate satisfactorily in magnetic fields of 
less than 700 gauss. At low field strengths the bombarding beam was so 
divergent that insufficient current was delivered to the tantalum button to 
achieve space-charge limitation in the primary gun. 

Except for the cathodes, both guns were of identical, parallel-flow, Pierce- 
type design (Pierce 1949). The experimental perveance was 1.1 10-7 (amp) 
(volt)-**. Cathode temperatures for space-charge-limited emission were 
2300° K for the tantalum cathode and 1350° K for the oxide cathode. 


The Measuring System 

The basic components of the measuring system are shown in Fig. 3. The 
electron beam was gated by a symmetrical square wave at 35c/s. Noise 
power coupled from the electron beam to the resonant cavity was amplified 
by a conventional superheterodyne receiver and, following audio detection 
and amplification, was detected coherently. McFarlane (1958) has analyzed 
this system and shown that the limiting sensitivity is a signal 35 db below 
the self-noise of the receiver. The measurement of shot-noise power in a 
beam current of 1 wa has been readily achieved. 
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Fic. 3. The electronic measuring apparatus. 
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IV. MEASUREMENTS 


In the first series of measurements for which the apparatus was used, the 
interception plate and the cavity were located at the anode of the electron 
gun. Noise smoothing, before and after interception by the various electrodes, 
was obtained as a function of electron gun voltage at a fixed value of magnetic 
field strength. The variation of smoothing with magnetic field strength was 
measured for a constant value of gun voltage. 

The investigation of the effect of interception on the noise space-charge 
waves in the drift space consisted in measurements of noise power as the 
electron gun was moved back from the cavity, with the interception plate 
at various positions between gun and cavity. These measurements were 
carried out for the condition of space-charge-limited emission and for tem- 


perature-limited emission. 


Calibration of the Measuring System 
The apparatus was calibrated by a method used by Cutler and Quate 


(1950). With the cavity at the anode of the electron gun, relative noise 
power was measured as the beam current was increased from zero to its maxi- 
mum value at space-charge limitation. Typical curves of noise power versus 
collector current which resulted from measurements using the bombarded- 
cathode gun are shown in Fig. 4. In the region of temperature-limited current 
the noise power rose linearly with collector current as predicted theoretically 
by the shot-noise relation, 
ig = eT Af. 


CURVE NO INTERCEPTOR % INTERCEPTION 
1 080" APERTURE 0 
2 FLAT GRID 22 
3 TILTED GRID 50 
4 


.040" APERTURE 43 


RELATIVE NOISE POWER 


TANTALUM CATHODE 
GUN VOLTAGE = 700 VOLTS 
MAGNETIC FIELD= 700 GAUSS 





' “0 100 1000 
AVERAGE COLLECTOR CURRENT (ya) 


Fic. 4. Typical curves of the variation of noise power with collector current. 
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The onset of smoothing, as the cathode temperature was increased further, 
may be clearly seen from the curves. At the maximum value of space-charge- 
limited current, the ratio of measured noise power to the equivalent shot- 
noise power determined by extrapolating the linear portion of the curve to 
the same current yields the smoothing factor T°. 

Measurements of the variation of interception noise with magnetic field 
strength were normalized to shot noise by means of smoothing curves obtained 
for a fixed value of magnetic field. Similarly, in the case of measurement of 
noise space-charge waves along the electron beam, the relative noise power 
levels of the maxima and minima were referred to the level at the anode 
which had been determined with respect to shot noise from the smoothing 
measurements. 


V. EXPERIMENTAL RESULTS AND DISCUSSION 


(a) Interception Noise as a Function of Magnetic Field Strength 
The measurements to be described here were made with the emission from 
the oxide-coated cathode space-charge-limited at an anode voltage of 700 volts. 
Figure 5 shows the measured values of beam noise after interception as a 
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Fic. 5. The variation of noise smoothing after interception as a function of the’strength 
of the confining magnetic field. Ordinate values are noise power normalized to full shot noise 
in the transmitted current. 
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function of the strength of the confining magnetic field. Ordinate values 
represent the noise content of the transmitted beam normalized to full shot 
noise in the transmitted current. In the case of the circular apertures, the 
amount of current transmitted was a strong function of the value of the 
magnetic field. The number associated with each experimental point in Fig. 5 
is the per cent transmission at that field strength. As noted on the graph, the 
transmission factor of the grids was independent of magnetic field strength. 

The striking difference between thé amount of noise added to the beam by 
grid interception and that added by aperture interception is evident from 
these measurements. For comparable fractions of current intercepted the 
magnitude of the smoothing is from 3 to 6 db less for grid interception than 
for aperture interception. 

For the purpose of comparison with theory, the values of the smoothing 
factor (I's? = noise smoothing in the beam after interception) predicted by 
Beam’s theory have been plotted on the graph for each of the intercepting 
electrodes. The measured values of I.” are in qualitative agreement with the 
functional variation predicted by the theory of Beam for magnetic field 
strengths greater than 150 gauss. The magnitude of the smoothing measured 
for the flat grid and for the .040-in. aperture lies within 1 db of the predicted 
value. The theoretical smoothing is consistently low in the case of the tilted 
grid, and high for the larger apertures. In the latter cases, modifying the 
theory to take into account the radial variation of current density has slightly 
improved the agreement between theory and experiment. The direct-current 
density in the actual beam was found to decrease much more slowly towards 
the beam edge than is predicted by theory. This may account in part for the 
discrepancy between the observed and predicted values of interception noise 
in cases where the beam is intercepted near the edge. 

Concerning the magnitude of the interception noise produced by the mesh 
grids it is interesting to note that for field strengths in the neighborhood of 
150 gauss the noise approaches or exceeds the North values plotted as curves 
Av and Bg in Fig. 5. At these values of magnetic field, the theoretical mean 
spreading radius is nearly equal to the radius of one grid module. In the 
actual physical beam, small d-c. perturbations in the electron flow, due to 
imperfect electrostatic focussing fields in the electron gun, probably increase 
the spreading radius. Hence it would be expected from theoretical arguments 
that the noise should approach the North value. 

The increase in the mean spreading radius due to transverse velocity other 
than that of thermal origin is believed to account for the fact that, for strong 
magnetic fields, the experimental values of I’? for the grids are higher than 
the theoretical values. 

From Fig. 5 it may be seen that the experimental values of smoothing for 
each of the intercepting electrodes decreased as the strength of the magnetic 
field decreased from about 150 gauss to zero. This is a region of transition 
from a mean spreading radius determined by the magnetic field to a mean 
spreading radius determined from transit time across the gun. The values of 
smoothing predicted from transit time considerations are in qualitative agree- 
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ment with the experimental measurements but are in error by 1.5 db for the 
tilted grid and the .040-in. aperture. 


(b) The Variation of Interception Noise with Gun Voltage 

The significant conditions of the experiment were: interception plate and 
cavity at the anode of the tantalum-cathode gun, magnetic field constant 
at 700 gauss. Figure 6 shows the collected results of smoothing measurements, 
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Fic. 6. The variation with gun voltage of noise smoothing after interception. 


similar to those indicated in Fig. 4, for gun voltages from 200 volts to 1000 
volts. The small amount of excess noise caused by aperture interception is 
again apparent. For 5% interception by the .060-in. aperture no change in 
smoothing was detectable within the precision of the measurements, which 
was +0.2db. The .040-in. aperture which intercepted 45% of the beam 
current caused less than 0.5-db decrease in smoothing. 

The results of a comparison of the experimental values of I? with the 
values predicted by the theories of Beam and North are not significantly 
different from those of the previous section. 


(c) Space-Charge Waves on an Intercepted Beam 

(t) Space-Charge-Limited Emission 

Using the tilted grid and the .040-in. aperture as intercepting electrodes, 
noise was measured as the electron gun was moved away from the cavity. 
This was done for two cases: first, with the intercepting electrode at, and 
moving with, the anode of the electron gun; second, with the intercepting 
electrode stationary at the cavity. Figure 7 shows the maxima and minima 
of the noise space-charge waves for a gun voltage of 700 volts. Ordinate values 
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are [';? = noise power relative to full shot noise in the transmitted current. It 
is to be emphasized that this graphical representation of the levels of the 
maxima and minima does not show the complete form of the space-charge 
waves, nor the fact that the wavelength is different in each case. 
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Fic. 7. Maxima and minima of the noise space-charge wave in the drift space for various 
conditions of interception. 


Curve Interceptor Position % Interception 
1 .080-in. aperture — 0 
2 .040-in. aperture At anode 45 
3 .040-in. aperture At cavity 45 
4 Tilted grid At anode 50 
5 Tilted grid At cavity 50 


The .040-in. aperture, intercepting 50% of the beam current at the anode 
(curve 2), had only a small effect on the space-charge wave. The level of the 
minima was increased by approximately 2 to 4db in 20 db. In the case of 
grid interception at the anode (curve 4), the over-all noise level was much 
higher, but the ratio of maximum power to minimum power was still large. 
The standing-wave ratio was in fact greater than that for the first minimum 
of the unintercepted beam (curve 1). 

When the beam was intercepted at the cavity (curves 3 and 5) the standing- 
wave ratio was greatly reduced. With the tilted grid at the cavity, the standing- 
wave ratio was of the order of 0.5 db as compared with 10 db for the grid at 
the anode. 

The gross effects of interception on the space-charge wave can be explained 
on the basis of the well-known theory of space-charge waves. At the inter- 
cepting electrode a new source of noise is created which establishes its own 
standing wave in the drift space. The total space-charge wave is the sum of 
the wave due to interception and the wave due to the noise in the beam 
before interception. The phase of the interception-noise wave is such that it 
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always has a maximum at the intercepting electrode. As long as the plane 
of interception occurs at a maximum of the standing wave in the unintercepted 
beam, the resultant standing wave still exhibits deep minima. When the 
interception takes place at the cavity, the noise due to interception is a 
maximum and almost completely masks the space-charge wave due to initial 
noise. 

(it) Interception Noise on a Temperature-Limited Beam 

Measurements of the variation of noise power with collector current such 
as those illustrated by Fig. 4 showed that, for a given value of collector 
current in the temperature-limited region, the noise power measured at the 
anode was not increased by interception. That is, the noise at the anode is 
full shot noise. Beyond the anode there are minima of the space-charge wave 
where the noise is less than full shot noise. 

The noise level at the first standing-wave minimum in a temperature- 
limited beam was measured for various conditions of interception. A constant 
collector current of 50 wa was used. The cathode current was larger than 
this depending on the particular intercepting electrode used. Results are 
listed in Table II. 


TABLE II 


Interception noise on a temperature-limited beam. I’? is the noise level of the first standing- 
wave minimum in decibels below shot noise. Gun voltage = 700 volts; collector current = 
50 wa; magnetic field = 700 gauss 


I’. experimental 











Percentage Interceptor Interceptor 
Interceptor interception at anode at cavity 
.080-in. aperture Zero 12.7 12.7 
.060-in. 3 15* 13.0 io 
.040-in. ° 57 14.0 5.0 
Tilted grid 45 11.5 3.0 
Flat grid 20 12.2 5.5 





*These values are approximate averages which take into account large fluctuations in intercepted current due 
to scalloping on the beam. 


These measurements provide conclusive evidence that interception noise 
can arise from a smoothing process external to the electron gun. The reduction 
of noise at the minima of the space-charge wave results from interactions 
which are completely independent of processes in the electron gun. 

The magnitude of the interception noise measured at the space-charge-wave 
minimum was considerably higher than that measured at the gun anode for 
space-charge-limited emission. Consideration of the behavior of a Pierce-type 
electron gun for low current densities has shown that there exists a highly 
distorted focussing field which produces excess transverse velocity. The con- 
sequent large increase in the mean spreading radius accounts for the higher 
values of interception noise. The experimental values of smoothing for the 
two grids are within 0.4 db of the North-theory value, which is indicative 
of the fact that the mean spreading radius was much larger than the effective 
radius of one grid module. 
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VII. CONCLUSIONS 

The interception by a circular aperture of a fraction of the current in an 
electron stream causes little increase in noise when the electron beam is 
confined by a strong magnetic field. Fifty per cent interception produced a 
decrease of only 0.5 db in approximately 10 db of smoothing. A mesh grid 
caused decreases in smoothing as great as 7 db for an equivalent fraction 
of current intercepted. 

The theories of North and Beam as modified by the author form a satis- 
factory basis for the prediction of the amount of interception noise caused 
by circular apertures and mesh grids. Perturbations in the electron flow, in 
particular the radial variation of direct current density, and excess transverse 
velocity due to imperfections in focussing fields have considerable effect on 
interception noise. 

Interception of current excites a standing wave of noise, along a drifting 
electron stream, that has a maximum of noise power at the plane of inter- 
ception. Thus, in beam-type microwave tubes, interception at the electron 
gun anode may be of little consequence, whereas interception by electro- 
magnetic structures such as a resonant cavity may seriously degrade the 
noise performance of the tube. 
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FIRST ADSORBED LAYER OF HELIUM AT 4.2° K! 


J. P. Hopson 


ABSTRACT 


An attempt has been made to measure the helium adsorption isotherm on 
pyrex (Corning 7740) at 4.2° K for pressures above the adsorbed layer ranging 
from 10~4 to 1072 mm Hg. New ultrahigh vacuum techniques were used for the 
pressure measurements with geometric adsorbing areas from 10 to 70 cm?. 
Above 10~® mm Hg the results give a true measurement. Below 10~-° mm Hg the 
results give an upper limit on the pressure above the adsorbed layer. The form 
of the isotherm indicates that the first adsorbed layer is complete at a pressure 
<10-° mm Hg. A comparison of the present results with previous measure- 
ments of the helium adsorption isotherm at 4.2° K, which extend down to 10-6 
mm Hg, suggests that all such measurements with solid adsorbents have been 
made on helium layers thicker than one monolayer. This conclusion supports 
similar conclusions drawn from heats of adsorption data at higher temperatures. 
An extrapolation of the results suggests that total gaseous densities approach- 
ing those of interstellar space (1.0X10-*%* g/cm’) should be possible in the 
laboratory, with liquid helium trapping. 


I. INTRODUCTION 


Adsorption isotherms of helium have been measured by many authors for 
various adsorbents (Keesom and Schmidt 1933; Stout and Giaque 1938; 
van Itterbeek and van Dingenen 1938; van Itterbeek, van Dingenen, and 
Borghs 1939; Keesom and Schweers 1941a, 6; Kistemaker 1947; Long and 
Meyer 1949; Schaeffer, Wendell, and Smith 1949; Frederikse and Gorter 
1950; White, Chien, and Johnston 1952; Bowers 1953; Brewer and Mendels- 
sohn 1953; Tjerkstra, Hooftman, and van den Meydenburg 1953; Aston, 
Mastrangelo, and Tykodi 1955; Strauss 1956; Meyer 1956; Hoffman, Edeskuty, 
and Hammel 1956). The measurements taken together have covered a pressure 
range from near-atmospheric pressure to 10-* mm Hg and a temperature 
range from 20° K to 1.2° K. Yet until recently the application of the available 
theories of adsorption left two basic questions unsatisfactorily answered: 

(a) What is the density of atoms in the first adsorbed layer? 

(6) At what pressure is this layer first complete? 

These two questions were not answered satisfactorily even above the 
\-point in a region where simple theories of adsorption might be expected to 
apply. The questions are related but (a) has been examined more extensively 
in the literature because the application of the well-known B.E.T. theory 
(Brunauer, Emmett, and Teller 1938) to the data yielded a first layer helium 
density that was anomalously high (Long and Meyer 1949; Band 1949; Aston 
and Mastrangelo 1951; Long and Meyer 1953; Atkins 1954; Singh and Band 
1955). 

Recently Steele (1956) and Meyer (1956) have independently proposed 
that the heat of adsorption of the second layer, although an order of magnitude 

1Manuscript received November 10, 1958. 
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smaller than that of the first layer, remains larger than the heat of vaporization, 
and as a consequence the standard B.E.T. theory has tended to give the 
density of two layers rather than one. The bulk anomaly in the first layer 
density is thus removed. A consequence of this new proposal is that the first 
layer should be complete at extremely low pressures. Meyer (1956) examining 
his low-pressure data for gold concluded that the first layer is complete at 
P/P ) ~ 1077 at 2° K and at P/P» > 1077 at 4.2° K. Steele (1956) does not 
treat (b) explicitly, but the question may be answered theoretically for TiO» at 
4.2° K by combining the isotherm discussed by Steele with the force constants 
obtained experimentally by Aston, Mastrangelo, and Tykodi (1955). The 
same answer is obtained by using these constants with the standard B.E.T. 
isotherm or the Langmuir isotherm (1918), but because of the greater generality 
of the isotherm examined by Steele we will use it here. This isotherm equation 


IS: 
: ) 1+2x | 2=55 ' cx 
(1) = a a ee 
7" bieeee 
1—x 


where v is the quantity adsorbed, v,, is the quantity adsorbed in a monolayer, 
x = P/P» = the saturation (P is the equilibrium pressure above the ad- 
sorbed layer, Po is the equilibrium pressure above the bulk liquid at the same 
temperature. Po = 760 mm Hg for helium at 4.2° K), 2 is a constant connected 
with the energy of adsorption of the second layer, c is the usual B.E.T. con- 
stant connected with the energy of adsorption of the first layer, 


(E1 — Eng) 
9 , =e - —eesroen 
(2) c = exp RT 


where E, is the heat of adsorption of the first layer, E yg is the heat of vaporiza- 
tion, R is the universal gas constant, and 7 is the absolute temperature. 
Aston, Mastrangelo, and Tykodi (1955) find FE, to be 700-200 cal/mole. 
We have arbitrarily assumed FE; = 300 cal/mole for calculation. From (2) 
this gives c = e® = 10" at 4.2°K. Taking 2, = .33 cm’ S.T.P./m? and 
z = eas given by Steele (1956) we plot equation (1), in Fig. 1. The knee at 
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Fic. 1. Calculated helium adsorption isotherm for TiOz at 4.2° K. Curve gives equation 
(1) with v, = .33 cm3/m?, c = e* = 10%, 2 = ef = 55. 
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x = 10~" (i.e. 1/c) represents the completion of the first layer. A non-constant 
E; as may be expected in practice will make the knee less clear but will not 
change its location, which lies well below the pressure range which has been 
studied by the authors quoted above. 

The measurements of Hobson and Redhead (1958) on the operation of an 
inverted-magnetron pressure gauge involved adsorption of helium on pyrex 
at 4.2° K at saturations near 10~'®. However, in these experiments the quantity 
of helium available for adsorption was limited to that diffusing through the 
pyrex envelope (Rogers, Buritz, and Alpert 1954) and was too small to yield 
a monolayer on any measurable area. A simple modification of the apparatus 
for the introduction of larger amounts of helium promised at least an explora- 
tory measurement of the helium adsorption isotherm on pyrex at 4.2° K for 
saturations from 10-7 to 10~-", thus lowering the limit of previous measure- 
ments by six orders of magnitude, and providing an experimental check on 
the major feature of Fig. 1, namely the very small value of x at which the 
monolayer is complete. For physical adsorption this feature is expected to be 
approximately independent of adsorbent. 

This paper describes such a measurement. 


II. APPARATUS 
‘The apparatus used is shown in Fig. 2. The envelope was pyrex glass 
(Corni.g 7740). The valves were ultrahigh vacuum valves (Alpert 1953; 
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Fic. 2. Apparatus. 


Bills and Allen 1955). The Bayard—Alpert gauges (B.A.G. No. 1, No. 2) were 
of standard design and were operated with collector at ground potential 
(i.e. 0), filament +45 v, grid +300 v, and grid current either 8 ma or 80 ya. 
The main adsorption volume V,; (915 cm’) was to the left of valve A and 
contained a magnetron pressure gauge (M.G.) (Redhead 1958), a B.A.G. 
(No. 1), and a finger which could be immersed in liquid helium at 4.2° K in 
a simple strip-silvered dewar. 

V, was similar to the sealed-off systems used by Hobson and Redhead 
(1958) except that in the present experiments valve A permitted control of 
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helium flow into V;. Valve C controlled flow from the gas source of spectro- 
scopically pure helium to the central volume V2 where pressure P: was 
measured by B.A.G. No. 2. V2 could be pumped through valve B with a mer- 
cury diffusion pump suitably trapped with liquid nitrogen. 

The system was prepared for measurements by baking the region shown 
in Fig. 2 for 15 hours at 400 to 450°C with all valves open and with the 
mercury diffusion pump running. Before measurements were begun, pressures 
of 5X10-" mm Hg were obtained in V,; with A closed, and 2 10-" mm Hg in 
V2 with C closed. It was found that diffusion of helium through the pyrex 
break-seal from the source flask at approximately atmospheric pressure was 
sufficient to raise P, to 10-7 mm Hg with C open, and to permit part of the 
experiment to be performed without breaking the seal. 

In all liquid helium immersions the dewar position was fixed and the liquid 
level was permitted to fall by natural evaporation after transfer of about 
170 cm? of liquid helium. A transfer lasted about one hour. The liquid level 
in contact with graduations on the finger could be observed visually im- 
mediately after transfer, but clouding in the dewar which was sealed at the 
top with a styrofoam cap and: glass wool sometimes made visual observations 
unreliable later. A single loop of .002-in. tantalum wire was wrapped hori- 
zontally around the finger about 1 in. from the tip. Superconductivity was 
not obtained, probably because of unsuitable annealing conditions (Webber 
1947), but with 30 ma direct current the resistance rose by 50% when the 
level fell below the wire, and a further 25% when the level left the tip of the 
finger. The latter event was also recognizable by a marked change in the rate 
of desorption of gas from the finger. By combining these data the helium 
level could be estimated at any time during a run. Attempts to maintain a 
fixed helium level by continuous raising of the dewar were not successful. 

In this paper, pressures at the gauges are quoted as equivalent nitrogen 
pressures, and those above the adsorbed layer as true helium pressures, 
unless otherwise stated. 

III. EXPERIMENTAL PROCEDURE 
(1) Calibration of Magnetron Gauge at Very Low Pressures 

In the main adsorption measurements, the M.G. was required to measure 
pressures below those measurable with the B.A.G. (lower reliable limit of 
B.A.G. No. 1 was about 3X 10~-' mm Hg because of uncertainties in its X-ray 
limit), and in this range a new calibration procedure was used which did not 
simply check the magnetron gauge ion current (mq) against P; given by 
B.A.G. No. 1, as was done at higher pressures. 

Valve A was closed and the finger immersed in liquid helium, with B.A.G. 
No. 1 turned off to eliminate any possible contaminants for its hot filament. 
Ima fell to a low value J) ~ 2X10-" amp corresponding to an unknown 
pressure Py (see Section V(a)) which was steady except for noise (signal-to- 
noise ratio about 3:1). With P2 set to a steady helium pressure of about 10~* 
mm Hg, valve A was cracked open and thereafter left untouched to permit a 
steady helium flow Q to enter V;. The relations believed to determine P; are 
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given in Fig. 3, which shows a simplified sketch of Fig. 2. Here k is the con- 
ductance of valve A, Smeg and Sy are the pumping speeds of the M.G. and 
the liquid helium trap respectively, and Py, is the pressure at the bottom of 
the finger. (The validity of the remark that Pye = 0 can be checked later 
from the adsorption results.) The values of Img and P, being discussed are 
values in excess of J) and Py» and thus result from Q. 
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Fic. 3. Principle of gauge calibration at very low pressures. Calibration experiment: 
plot Ima vs. P2. 


With A set, P; was proportional to P» and relative values of P; were 
obtained by adjusting P2 with valve C. P, was readily measured with B.A.G. 
No. 2 and the major portion of the calibration of the M.G. consisted in plotted 
Iu against Ps. As a general experimental check on the validity of the analysis, 
it was found that for at least 10 minutes after a change in Pe, Ime remained 
time-independent, showing Sy to be sensibly independent of the liquid helium 
level. The value of the constant k/S,7 could be obtained when P, was sufficiently 
high to be measured directly by B.A.G. No. 1. 

The results of three such measurements are shown in Fig. 4. The values of 
P,. shown are not exactly those observed, since the relation between yg and 
P», depended on the setting of valve A which was not exactly reproduced in 
each run. However, in each run P, = 10% P; as shown. It can be seen that 
the near-linear law between Jyg and P, breaks down at a current of 1.1 107° 
amp, corresponding to a pressure of 4X 10—-" mm Hg. 

Sudden changes in the operating characteristics of magnetron-type gauges 
have been observed (Conn and Daglish 1954), but in the present case the 
question arose whether the break in Fig. 4 might possibly result from the 
calibration method itself, which was new. The method was applied to B.A.G. 
No. 1 and no break was found down to P; = 1X10-" mm Hg. The M.G. 
and B.A.G. No. 1 were tested simultaneously and the break was present in 
the M.G. characteristic but not in the B.A.G. characteristic, although the 
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angle of 59° in Fig. 4 shifted to 54°. Next, the experiment was repeated with 
the M.G. operating with 5 kv between plate and cathode, but with 2060 gauss 
rather than 1060 gauss. The resulting calibration curve was quite different 
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Fic. 4. Result of magnetron gauge calibration. 


from that of Fig. 4. None of these data suggested any inherent error in the 
method of calibration, but all indicated that the calibration result was specific 
to M.G. operating conditions. The method of calibration was adopted as 
valid, and the low-pressure adsorption measurements were made with the 
M.G. operating conditions the same as those used during the calibration runs 
of Fig. 4. 


(2) Adsorption Measurements 

The objective of these measurements was to establish the equilibrium 
pressure above a layer of helium of known density per unit area, adsorbed on 
the inside of the pyrex finger. 

Large adsorbed quantities were prepared by adjusting P: to a steady calcu- 
lated value (B.A.G. No. 2 pumping very slowly with grid current 80 wa) with 
A, B, C closed. The finger was then immersed in liquid helium, producing 
conditions similar to those at the start of a calibration run. Valve A was next 
opened and the gas in V2 allowed to be adsorbed in the finger. Valve A was 
then closed. After a few minutes the gauge current became constant except 
for noise and a slow increase caused by the falling liquid level. This pressure 
reading was converted to that above the adsorbed layer by corrections given 
below. When the pressure began to rise rapidly, indicating large desorption 
of gas from the finger as a result of disappearing liquid, the M.G., if operating, 
was switched off to prevent ionic pumping of the desorbing helium. When all 
liquid helium was gone, the pressure was measured by B.A.G. No. 1 operating 
at 80 wa. From this pressure reading, after a small correction for finger tem- 
perature, the quantity of gas adsorbed at any time could be deduced. This 
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pressure reading could be predictably calculated from the initial value of 
P, and the known volume ratio V;/V2, demonstrating that no helium was 
adsorbed irreversibly. This result was obtained previously by Hobson and 
Redhead (1958) for very small adsorbed quantities. 

For small adsorbed amounts, P2 was set to a fairly high value (~107° 
mm Hg) before A was opened partially for a short time and then closed. 
Experience permitted the quantity of gas so admitted to be judged quite 
accurately. This procedure was adopted for small quantities to maintain the 
ratio of helium to possible background gases in V2 at a high value. 

Certain assumptions concerning the pressure measurements and adsorbing 
areas were made which reduce the absolute accuracy of the results but which 
should yield valid relative results. 

(a) Pressure Measurements.—Pressure measurements above the adsorbed 
layer were made with the M.G. alone at pressures below 5X107'® mm Hg 
and with either or both gauges between this level and 10-7 mm Hg. Above 
10-7 mm Hg errors might be expected from the high pumping speed of the 
M.G. (~.13 liter/second for helium) so only B.A.G. No. 1 was used. Since 
the gauges measuring P; were at room temperature (7 = 295° K) and the 
desired pressures P were those at 7 = 4.2° K, a thermomolecular pressure 
correction (Keesom 1942) was made according to the formula 
. fl ee oy 
(3) ry i = 
The calibration of the B.A.G. design used, as given by the manufacturer in 
equivalent nitrogen pressure, was adopted. However, ionization gauges are 
less sensitive to helium than to nitrogen and for the same ionization current 
the helium pressure is about 1/.16 times higher than the nitrogen pressure 
(Alpert 1953). This figure was adopted to convert observed ion currents to 
true helium pressures in the results presented below. 

(b) Adsorbing Surface Area.—The adsorbing area has been taken as the 
simple geometric area of the inside surface of the finger (I.D. 2.2 cm, O.D. 
2.5 cm) lying below the plane of the liquid helium surface. Adsorbing areas 
ranging from 10 to 70 cm? were used in calculating the results below. 


IV. RESULTS 

The results of 12 runs of the type described above are given in Fig. 5 where 
the number of atoms adsorbed per cm? is plotted against the measured pressure 
above the adsorbed layer, on log-log paper. The dotted line has been drawn 
visually. Three regions are apparent. 

(a) At a pressure P = 1.5X10-" mm Hg the points lie almost vertically 
and there is very little scatter. 

(6) At pressures between 1.5 10-” mm Hg and 5X107!° mm Hg, the points 
lie approximately on a line at 45° to the horizontal and there is considerable 
scatter. The type of observation giving rise to this scatter is shown in Fig. 6, 
which compares two typical records of Jug for approximately the same 
pressure: first, when the M.G. was measuring pressure with the finger immersed 
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Fic. 5. Adsorption isotherm measurements for helium on pyrex at 4.2° K. Each symbol 
represents a different run. Solid symbols were obtained with magnetron gauge. Open symbols 
were obtained with Bayard—Alpert gauge. 
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Fic. 6. Record of fluctuations in pressure above the adsorbed layer. 


in liquid helium, and second, when there was no helium adsorption. Points in 
Fig. 5 were obtained from those in Fig. 6 by taking pressure readings at 
1-minute intervals. 

(c) Above 5X10-* mm Hg the points lie approximately on a line at 7° to 
the horizontal and again there is comparatively little scatter. 

V. INTERPRETATION OF RESULTS 

The proposed interpretation of these three regions is as follows. 

(a) The limiting pressure 1.5X10-" mm Hg represents an instrumental 
limit of the apparatus. An approximate calculation of the residual pressure 
to be expected when the helium diffusing through the pyrex walls of V; 
flowed through the conductance of the finger gave an expected limit in Fig. 5 
of 9.5X10-" mm Hg. In this calculation the finger (2.2 cm I.D.) was assumed 
to have, in series, 9 cm at 295° K, 5.5 cm at 190° K, and 5 cm at 40° K, and 
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to be terminated by a plane at 4° K. The calculation assumes that the equili- 
brium pressure above the adsorbed layer may be neglected (Section V(0)). 
The resulting trap pumping speed was 4.8 liters/second. The measured 
helium diffusion rate was 4X10-!° mm Hg (equiv. Ne)/minute. Such high 
values have been discussed previously by Hobson and Redhead (1958). 
While the quantitative agreement between the observed and calculated 
instrumental limits is not fully satisfactory, nevertheless it does show that 
an instrumental limit is to be expected at roughly the pressure at which such 
a limit is observed. If this instrumental limit is taken as 1.5X10-" mm Hg, 
then this pressure is to be subtracted from all other measured pressures above 
the adsorbed layer. The arrows in Fig. 5 show how a few points are shifted 
when this is done. The points are shifted progressively less for higher pressures. 

(b) In this region between a pressure of 1.5X10~-" mm Hg and 5X107!° 
mm Hg the amount adsorbed is approximately proportional to the measured 
pressure. If the heat of adsorption is independent of coverage, this result is 
to be expected for a partial monolayer (see Fig. 1). The absolute value of 
numbers of atoms adsorbed per cm? in this region is in general agreement with 
this assignment. This interpretation was at first placed on region (6) (Hobson 
1958). 

However, the following instrumental considerations place some doubt 
upon this interpretation. With an adsorbing area of 40 cm? corresponding 
roughly to the midpoint of a run, the adsorbing area was decreasing at about 
2% per minute because of the falling liquid level. If the adsorbed layer is 
immobile (de Boer and Kruyer 1952), as seems likely from the considerations 
of Section VI, then at least .02N atoms/minute are being released into the 
gaseous phase from a region near the liquid level, where V is the total number 
of adsorbed atoms. 

Let us examine a model in which volume V, is imagined to be cut into two 
by the plane of the liquid level. Let the upper volume be U, which is bounded 
by the imaginary plane and all parts of V; not at 4.2°K (U ~ 894 cm‘). 
Let the lower volume be ZL, which is bounded by the imaginary plane and all 
parts of V; which are at 4.2° K. Further, let half the .02V atoms released per 
minute be projected into U and the other half into L. Now assume that the 
pressure above the adsorbed layer is zero; i.e., that all atoms entering Z never 
leave except by being released at the liquid level. U contains two pumps, 
namely the M.G. and the imaginary plane. The pumping speed of the M.G. is 
known to be about .1 liter/second, while the pumping speed of the plane of 
area 4 at T = 4.2°K for helium (M = 4) is given by 


(4) F = 3.6884+/(T/M) = 13.8 liters/second. 
Thus the total pumping speed of U is 13.9 liters/second. Consider now a 
point in region (6) of Fig. 5—say, 8 X 10" atoms/cm? adsorbed at P = 1X10~-'° 


mm Hg. The pressure P; as measured at the gauge is obtained from 
equation (3). 


(5) P, = 8.4X10~'® mm Hg (true helium pressure). 


a~- S* aS FT 
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For an adsorbing area of 40 cm?, V = 3.2X10" atoms, giving a rate of 
input into U of 3.210" atoms/minute or an equivalent room temperature 
leak rate of 1.7X10-§ mm Hg/second. The equilibrium pressure expected 
with a pumping speed given by equation (4) is 


(6) P, = [(.894) (1.7 X 10-$)]/13.8 
= 1.1X10~*® mm Hg (true helium pressure). 


The general agreement between the observed value (5) and the value (6) 
calculated from a model in which the true pressure above the layer is negligible 
suggests that the pressures of region (b) be taken only as upper bounds on the 
true pressures above the adsorbed layer. The model which is linear in N 
predicts a 45° slope to the observations and provides a qualitative explanation 
for Fig. 6 since the gas causing the observed pressure originates at the liquid 
level where fluctuations are probable. 

(c) In this region above 10-° mm Hg the results represent true pressure 
measurements above the adsorbed layer. 


VI. DISCUSSION 


Figure 7 compares the present measurement with previous measurements 
on the physical adsorption of helium at 4.2° K. The scales are those commonly 
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Fic. 7. Comparison of experimental results on the physical adsorption of helium at 4.2° K. 
The adsorbent material follows the authors’ names. The dotted portion of the present results 
represents an upper bound on the pressures above the adsorbed layer. Van_Itterbeek, 
van Dingenen, and Borghs (1939) and also White, Chien, and Johnston (1952) reported 
isotherms at 4.2° K on charcoal and silica gel respectively, but did not report the area of 
their adsorbent. 


used in the adsorption literature. With the exception of Keesom and Schweers’ 
(1941b) measurements with preadsorbed gases on thuringian glass, which 
are discussed below, the general similarity of Fig. 7 to Fig. 1 is plain. The 
essential parameter which determines the location of the knee in Fig. 1 is the 
magnitude of c which was obtained from the experimental heat of adsorption 
measured by Aston, Mastrangelo, and Tykodi (1955) for TiO, at 17° K. 
Thus the correlation between this heat of adsorption and the low temperature, 
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low pressure data appears to be experimentally established. It is not surprising 
therefore that some of the conclusions which are drawn below from an examina- 
tion of Fig. 7, which gives results at one temperature only, have already been 
drawn by Aston and his colleagues from their measurements of heats of 
adsorption at much higher temperatures and pressures. 

No changes in slope which could be interpreted as decisive evidence for the 
completion of the first adsorbed layer are seen, and it is proposed that no 
authors to date, probably including the present one, have made true pressure 
measurements above a partial monolayer of helium on a solid adsorbent for 
T = 4.2°K. At temperatures below 4.2° K the first adsorbed layer is ex- 
pected to be complete at even lower saturations than at 4.2° K. It may be 
noted that the results of Meyer (1956) on gold fit this conclusion without 
essential difficulty, although there is a suggestion that the first layer is com- 
plete at P/Py) ~ 10-°. 

The present experiment gives the result that the first adsorbed layer is 
complete at P/P)< 10-", thus c > 10”. From (2) this places a lower bound 
on the heat of adsorption in the first layer at 250 cal/mole. This is the highest 
heat of adsorption yet quoted from measurements in the temperature range 
T < 4.2° K, the previous high being 148 cal/mole (Stout and Giaque 1938). 
However, heats of adsorption obtained near 17° K place the heat of adsorption 
between 700 and 200 cal/mole in the first layer (Aston, Mastrangelo, and 
Tykodi 1955; van Itterbeek and van Dingenen 1938). An evaluation of the 
heat of adsorption on charcoal by Steele and Halsey (1954) from apparent 
volume measurements gives 600 cal/mole. These results support the con- 
clusion that only at temperatures well above 4.2° K have pressure measure- 
ments been successfully made above partial monolayers of helium. 

From the results of Keesom and Schweers (19416) shown in Fig. 7 it is 
concluded that Ne and H: preadsorbed on thuringian glass reduce the heat of 
adsorption for the first helium layer so that this layer is completed at P/Po 
> 10-5, while with preadsorbed O, the first layer is complete at P/ Po < 10~’, 
and with preadsorbed N»2 at P/P») < 10~°. It is interesting to present here 
Table I which gives the relative permeabilities through glass and the molecular 
diameters of these four gases and helium. Table I is taken from a paper by 
Norton (1957). The interpretation usually placed on the correlation in Table I 
is that diffusion takes place through holes in glass which are large enough to 


TABLE I 


Permeation of gases through silica glass 





Permeation K through Atomic or mole- 
Gas silica glass at 700° C cular diameter, A 
Helium 2.1X<10-* 1.95 
Hydrogen 2.1X10° 2.5 
Neon 4.210710 2.4 
Oxygen Under 107-% 3.2 
Nitrogen Under 10-% 3.4 


K = cm gas (S.T.P.) per second per cm? area per mm thickness 
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pass the smaller molecules but not the larger. On the surface of the glass 
these holes appear as sites (Tilton 1957) upon which physical adsorption 
might be expected to occur preferentially, and it is interesting to find that 
preadsorbed Ho», Ne, Ov, Nz apparently lower the heat of adsorption of helium 
on thuringian glass in the same order that they are arranged in Table I, 
suggesting saturation of the adsorption sites by the preadsorbed gas. Steele 
and Aston (1957) have examined the effect on the heat of adsorption of 
helium on TiO, caused by preadsorption of argon, and have arrived at similar 
conclusions. 

It is desirable to obtain an accurate measure of v,, from adsorption iso- 
therms in the vicinity of the completion of the first layer, both because 1,, 
is an important quantity, and in order to check Steele’s (1956) values of v,, 
obtained at higher saturations. The present measurements are quite in- 
adequate for the accuracy required because (a) all measurements were pressure 
measurements which were not calibrated directly against an absolute mano- 
meter, (b) the adsorbing area was not known exactly, (c) the scatter in the 
observed results was serious, (d) it is possible that the first layer density was 
determined by the density of adsorption sites rather than by interaction of 
adsorbed helium atoms. 

It is now clear that accurate measurements of v,, for 7 < 4.2° K in the 
vicinity of monolayer completion will require accurate pressure measurements 
below 10-9 mm Hg. 

If the general interpretation placed on the results in Section V is correct, 
then an upper limit may be placed on the equilibrium pressure achievable 
above the minimum quantity adsorbed in Fig. 5. It is obtained by drawing a 
straight line horizontally from the point representing the minimum adsorbed 
density (~10'® atoms/cm*) to the left until an intersection is made with the 
line at 45°. The pressure is about 10- mm Hg. This minimum quantity was 
determined by the helium diffusion into the system which was about an 
order higher than is usual for pyrex systems (Hobson and Redhead 1958). 
Further the normal diffusion rate may be reduced greatly by cooling the 
pyrex (Rogers, Buritz, and Alpert 1954) or by the use of materials other 
than pyrex for the vessel walls. With care the sources of other gases may be 
made small compared with the helium source even in a pyrex system, and 
these other gases may be expected to be pumped by liquid helium traps more 
effectively than helium itself. Thus the low pressure limitations of the present 
system regarding helium do not appear to be fundamental. 

Gomer and his co-workers (e.g., Gomer, Wortman, and Lundy 1957) have 
used field emission microscopes with pyrex bulbs immersed in liquid helium. 
It is of interest to apply the results of Fig. 5 to their case. Their bulbs were 
approximately 100 cm* and spherical, and soon after seal-off could be cooled 
so that no further helium diffusion from the atmosphere took place. If it is 
assumed that the time between seal-off and cooling was 5 minutes, and that 
the helium leak rate was 10-!° mm Hg/minute (true helium pressure), then 
a helium pressure of 5X 10~!° mm Hg was present at the time of cooling. When 
these atoms were absorbed there were about 2X10? atoms/cm? on the inside 
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surface of the bulb. By extrapolating Fig. 5 an upper limit on the helium 
pressure present is 2X10-"7 mm Hg. This corresponds to a gaseous density 
of 5X10-*4 g/cm’, which may be compared with the mean density for inter- 
stellar gas of 1X10-*4 g/cm*® quoted by Allen (1955). Thus pressures in the 
laboratory approaching those of interstellar space do not appear unreasonable. 
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PROLONGED SIGNAL FADE-OUT ON A SHORT MICROWAVE 
PATH! 


D. R. Hay? AND G. E. Poaps 


ABSTRACT 


During a period of 1 year, the incidence of signal fade-out has been observed 
in 2000 Mc/s transmissions over a 21-mile path near Ottawa. Fade-out durations 
varied from a few minutes to several hours, with the most frequent occurrence in 
summer and during the night. The cause of signal fade-out has been studied 
qualitatively through a ray analysis of the air refractivity profiles at the center of 
the radio path. This analysis indicates that fade-out is associated with a shallow, 
horizontal transition zone in air vapor pressure at a level near the antenna 
heights. Here, the vapor pressure changes through 1 millibar or more in a height 
interval of 100 ft. The signal fade-out is weak if the transition is from dry air 
below to moist air above, but is strong if the inverse transition occurs. Weak 
fade-out (a ‘radio hole’) is due to ray divergence, and strong fade-out (a ‘radio 
antihole’) is due to ray interference at the receiver. 


INTRODUCTION 

The infrequent disappearance of signal in transmission over a line-of-sight 
path is an accepted characteristic of microwave propagation. Normally, the 
signal is relatively steady; variation in amplitude is less than a few decibels 
about the mean level. The signal does, however, decrease by many decibels 
below the mean level on some occasions, even when atmospheric absorption 
is negligible. At such times, the signal may remain weak for several hours 
before returning to the normal level. These depressions in signal transmission 
are called signal fade-out, when the level of the signal decreases by more than 
five decibels from the mean. The term ‘prolonged signal fade-out’ is applied 
to this characteristic of the signal to distinguish it from the more rapid fading 
which occurs continually in transmissions over much longer ranges. 

The cause of prolonged signal fade-out has not been understood, although 
this characteristic of signal transmission has been observed for several years. 
During the period 1939-45, many records of signal fade-out having durations 
of a few minutes were obtained on overwater and overland paths (Kerr 1951, 
Chapter 4). Prolonged signal fade-out was reported at a later date by Bean 
(1954) in the United States, and by others in Australia and New Zealand 
(Voge 1953). It is the practical difficulty of examining the radio path at the 
appropriate time that has delayed an extensive study of the problem. 

This paper presents a qualitative study of microwave signal fade-out as 
carried out in eastern Ontario. Here, particular attention will be paid to the 
physical properties of the air near the earth’s surface which have favored this 


weakening in signal transmission. 
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STATISTICS ON FADE-OUT 

The incidence of fade-out was studied in signal transmissions over a 20.7- 
mile path near Ottawa. For this purpose, a 2000 Mc/s signal was directed from 
Franktown towards Hazeldean between 8-ft paraboloidal antennas. These 
antennas were mounted over 200 ft above ground to ensure a clear intervening 
path. Recording of the received signal power was continued from October 
1956 until October 1957. 

Two distinctive types of signal fade-out were found in the microwave 
transmissions. Sample records that illustrate these two types are shown in 
Fig. 1. The normal signal is relatively steady at the zero-decibel level on each 
record. Figure 1(a) indicates a weak fade-out, in which the signal amplitude 
decreased as much as 20 db below the mean level. It will be noted that changes 
in signal amplitude are gradual. The second type, a strong signal fade-out, is 
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Fic. 1. Records of signal amplitude indicating types of fade-out. 
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shown in Fig. 1(c). Here the signal depression is more abrupt, and the minimum 
level is weaker than 30 db below the mean. The signal just before and after 
the fade-out is above the mean level. Figure 1() illustrates a signal fade-out 
that appears to be a combination of these two types. 

The incidence of fade-out in the Ottawa area followed generally the pattern 
which has been reported for the central United States (Bean 1954). Figure 2 
indicates the marked seasonal tendency. It will be seen that the signal was in 
fade-out for more than 24% of the time in summer, but for less than 5% of 
the time during the winter. The diurnal variation in fade-out incidence is 
shown in Fig. 3, where maximum frequency is indicated for the hours between 
midnight and 0600 hours. No fade-out has been observed on the Ottawa path 
between 1000 and 1600 hours. The onset of fade-out after sunset was spread 
over a longer interval than was the disappearance after sunrise. 
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Fic, 2. Seasonal variation in signal fade-out (1957). 
Fic. 3. Diurnal variation in incidence of signal fade-out (total for 1957). 


Further analysis of fade-out on the Ottawa path has shown that the average 
duration was between 8 and 9 hours. The duration was generally independent 
of the maximum depth of the fade-out, and of the time of year. The weak type 
of fade-out in Fig. 1(a@) was found throughout the year, but strong fade-out 
(Fig. I1(c)) was entirely absent in winter. The longest period during which 
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the signal depression exceeded 30 db below the mean level was 6 minutes; 
such depressions occurred for less than 0.3% of the time in summer. 

The dependence of signal fade-out upon locality was apparent from a 
similar study on two separated radio paths. The second path was 31 miles 
in length, and was located about 25 miles north of North Bay. This experi- 
mental link was 200 miles from the Franktown—Hazeldean link. It was found 
that over a period of 1 year, the seasonal and diurnal trends in fade-out in- 
cidence were similar on the two paths. However, a detailed examination showed 
that there was little correlation between the incidence of individual fade-outs 
at Hazeldean and at North Bay. 


REFRACTIVITY PROFILES 

Special soundings of the temperature and humidity profiles through the 
air at the radio path were made during August and September of 1957. The 
site for these measurements was located midway between the transmitter 
and receiver. Several of these soundings were carried out while the microwave 
transmission was depressed by fade-out. 

Conventional radiosonde equipment was modified for these special measure- 
ments. Two radiosondes were combined into one instrument so that together 
they would indicate air temperature and humidity continuously.’ Although 
the slow response of the sensing elements in the radiosonde did not permit this 
type of instrument to indicate the fine structure in the air profiles, the radio- 
sonde nevertheless was capable of sensing the principal features through 
transition layers in the air. The soundings at the center of the Franktown- 
Hazeldean path were made by controlled ascent and descent of the radiosonde, 
at a rate which permitted the sensing elements to approach equilibrium with 
the sampled air. 

Some notable features were apparent in the temperature and dewpoint 
profiles of the air at the time of signal fade-out. In all, 27 soundings were 
made during August and September; seven of these were associated with 
prominent signal depressions at the Hazeldean receiver. Examination of these 
profiles showed that the signal transmission between Franktown and Hazeldean 
was normal when the air was well mixed and the lapse rates of temperature 
and humidity were approximately uniform through the lowest 1000 ft of the 
air. However, a prominent departure from the uniform lapse rates, at an 
altitude near the heights of the transmitting and receiving antennas, was 
observed whenever the microwave signal at Hazeldean was in fade-out. 

Refractivity profiles have been derived from the temperature and dew- 
point soundings, for a ray analysis of the radio path. The air pressure profiles, 
which are required also in computing refractivity, have been obtained from 
conventional radiosonde measurements at a nearby weather station. For 
the purpose of this ray analysis, curvature of the earth’s surface and of the 

’The pressure-operated switches in two Bendix—Friez radiosondes (model 279) were re- 
moved, and one radiosonde was set to measure temperature only while the other indicated 
only the humidity of the air. The height of the radiosonde was determined to an accuracy of 


3% through the combined measurements of an optical inclinometer on the ground and a 
gauge that measured the length of the tethering cord (Anderson 1947). 
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layer of air above it is simulated by converting the refractivity profile of the 
air into a modified index profile (/-profile), and by taking the mean sea 
level as a flat surface (Kerr 1951, Section 2.4). The three M/-profiles in Fig. 4 
illustrate the main features of all of the profiles derived from the temperature 
and dewpoint soundings during signal fade-out. These profiles apply to the 
radio path at the times of the three signal recordings illustrated in Fig. 1. 


RADIO RAY PATHS 

The radio ray paths in the vertical plane between the Franktown trans- 
mitter and the Hazeldean receiver have been examined for an explanation of 
the observed signal fade-out. In this analysis it has been assumed that the 
physical structure of the air is spherically symmetrical and parallel to the 
mean sea level. Hence the vertical /-profile should be the same everywhere 
between the transmitter and receiver. 

The diagrams on the right in Figs. 4(a), 4(6), and 4(c) represent the ray 
paths for the indicated /-profiles. Here, the rays have been constructed by 
the methods of geometrical optics (Kerr 1951, Section 2.3), since significant 
changes in refractivity take place over distances that are large compared 
with the radio wavelength.‘ It will be noted that the rays diverge from the 
transmitter with equal increments in angle of elevation and that the rays 
illustrated here occupy only a fraction of the antenna beamwidth. Then the 
signal field intensity is uniformly divided between the rays near the trans- 
mitter. Signal reflection from the ground has been omitted in these diagrams 
because the reflection coefficient may be assumed small at microwave frequen- 
cies (Bullington 1954). 

Examination of the ray diagrams in Fig. 4 shows that the prominent 
irregularity in the \/-profile near the antenna heights is the cause of a disturb- 
ance in an otherwise regular pattern. Two distinct effects of this irregularity 
upon the ray path are illustrated here. In the first example, Fig. 4(@) shows 
that a region of weak field intensity has been created at the receiver by the 
divergence of neighboring rays. The second effect is indicated in Fig. 4(¢), in 
which the irregularity has caused a bundle of rays to be diverted so that ray 
interference occurs at the receiver antenna. Figure 4()) illustrates a com- 
bination of these two patterns. Elsewhere, the region of weak field strength 
(Fig. 4(@)) is termed a ‘radio hole’, and the region of ray convergence in Figs. 
4(b) and 4(c) a ‘radio antihole’ (Wong 1958). 

The abrupt irregularity in the /-profile of Fig. 4(c) suggests the possibility 
of partial reflection of the incident microwave signal. Northover (1955) has 
found that elevated layer discontinuities in the air act as Fresnel reflecting 
surfaces at meter wavelengths; the reflection at centimeter wavelengths is 
generally weaker although still appreciable. Since partial reflections are 
ignored in the preceding ray analysis, they will be considered here as a possible 


4An extensive analysis of ray paths through numerous refractivity profiles in the United 
States has been carried out for ground-to-air and air-to-air transmissions, with the use of an 
analogue computer. A description of the technique and of the results has been reported recently 
by Wong (1958). 
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cause of signal fade-out. The reflection coefficient of the discontinuity will be 
computed and specular reflection assumed to occur at the layer. 

A simple calculation shows that partial reflection from an irregularity which 
is even more abrupt than that in the example of Fig. 4(c) cannot explain 
strong signal fade-out. If the M-profile in this figure contained the change 
from 333 to 400 M-units entirely at a height of 230 ft above the transmitter 
antenna, and if the refractivity is uniform above and below the boundary, the 
reflection coefficient of the boundary would be —0.43.5 Interference between 
the reflected signal and the direct signal from the transmitter would cause a 
total change in the received signal of less than eight decibels. Indeed, a reflect- 
ing surface of this type would have to lie almost along the line joining the 
transmitter to the receiver in order to reflect sufficient signal for a strong 
fade-out. 

From the above considerations, it appears that signal fade-out results from 
a disturbance of the radio transmission path by a prominent irregularity in 
the air refractivity. A ray analysis similar to those illustrated here has been 
carried out on all of the \/-profiles associated with signal fade-out. The results 
indicate that weak signal fade-out (depression less than 20 db) is caused by 
simple weakening of the field at the receiver through ray divergence. Strong 
fade-out (depression greater than 30 db), however, requires the more complex 
multiple ray interference. The latter explanation is in accord with studies 
reported elsewhere on the synthesis of a deeply fading signal (Crawford and 
Jakes 1952; DeLange 1952; Kaylor 1953). 

A note should be added here on the apparent inaccuracy of the ray diagrams. 
There appears to be no reasonable alternative to the explanation given above 
for signal fade-out, as based upon the ray analysis of the M-profiles. And yet, 
the receiver antenna generally is not in a region where fade-out would be 
predicted by the ray diagram. This departure must be attributed in part to 
the fact that the thin transition layers which lead to fade-out are rarely 
stationary, and to the difficulty in measuring temperature and dewpoint at 
moving layers with the modified radiosonde.® Inspection of the diagrams in 
Fig. 4 shows that the profile measurements at critical heights did not 
coincide with the time of maximum fade-out, and consequently the direction 
of the ray focusing would be altered during the sounding. 


5From Stratton (1941), Section 9.5, the reflection coefficient (I) in which dielectric constant 
is replaced by refractive index (m) is 


; Ne 2 V2 
, = KG) ~cos¥ | _  5X10-$— 1.27 X10 


- ne\? ve ~ 5X10-3+41.27 X10 
sin y+ [() -cos¥ | 


where y = the grazing angle of reflection = 5 milliradians, and subscripts | and 2 refer to the 
media containing the reflected wave and the transmitted wave respectively. This equation 
applies to vertically or horizontally polarized waves in air. 

‘Soundings for each profile with the modified radiosonde required a total time of 10 to 45 
minutes. The numbered arrows in Fig. 1 and at the M-protile of Fig. 4 indicate the position 
of the radiosonde during the signal recording interval. 
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CLIMATOLOGY 


Information on the layer structure that leads to signal fade-out has been 
obtained from the radiosonde measurements. An inspection of the ray diagrams 
shows that changes in the amplitude of the radio signal at the receiver will be 
small unless a prominent departure from the mean occurs in the M-profile. 
It is necessary, further, that the departure appears near the line between the 
transmitter and receiver. Irregularities in refractivity at other heights have 
only a minor effect upon the ray diagram. The formation of a ‘radio hole’ 
(shallow fade-out in Fig. 4(a)) at the receiver accompanies a decrease in the 
slope of the M-profile over a shallow height interval, while a ‘radio antihole’ 
(deep fade-out in Figs. 4(b), 4(c)) is related to an inversion of the M-profile, 
also through a shallow height interval. All of the departures from mean 
refractivity that contributed to signal fade-out were less than 100 ft in 
thickness. 

The temperature and humidity soundings indicate further that each of the 
significant departures in refractivity was due to a transition from dry to 
moist air through the shallow height interval. These transitions consisted of 
changes in the air vapor pressure of about one millibar or greater. Moist air 
is below the drier air for inversions in the M-profile, but drier air is below 
the moist air for a decrease in the slope of the M-profile. The example of Fig. 
4(c) indicates a change in vapor pressure of 21 millibars at the lower boundary 
of the elevated irregularity, and a more gradual transition of the same order 
in the reverse direction throughout the rest of the irregularity. 

A system of stationary air pressure over the transmission path is a general 
requirement for microwave signal fade-out. During the 12 months of the 
experiment, 64% of the fade-outs were simultaneous with a center of high 
pressure and 30% with the slowly changing transition interval between 
centers of high and low pressure. The remaining 6% followed shortly after a 
center of low pressure had moved through the radio path. 

The motion of the shallow transition layers associated with signal fade- 
out should be commented upon. The radiosonde measurements of September 
19 which yielded the M-profile in Fig. 4(b) also showed that the prominent 
irregularity near the height of the antenna appeared to descend through a 
vertical interval of 100 ft in 1 hour. During this time, only a minor change 
occurred in the refractivity profile through this irregularity. The apparent 
vertical motion may be due either to horizontal movement of an inclined 
layer or to the descending of a horizontal layer. In either case, the effect of 
the layer upon microwave signal transmission would be essentially the same, 
since the ray focusing is present even when the layer is inclined so that it is 
parallel to the mean slope of the land. 


REMARKS 


The ray analysis presented here indicates that microwave signal fade-out 
is associated with a shallow layer of transition in air vapor pressure. All of 
the observed transitions occurred through a height interval of the order of 
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100 ft; the change in vapor pressure was | millibar or greater. Further, these 
transitions were found within several hundred feet of the ground and near the 
height of the antennas. It is interesting to note that the condition for shallow 
signal fade-out, viz. a transition from dry air below to moist air above, is 
found at the latitude of Ottawa throughout the year. However, the radio 
transmission experiment suggests that the inverse transition for deep signal 
fade-out is absent during colder weather. Such shallow transitions in vapor 
pressure apparently depend upon vertical stability of the air, since they are 
formed generally at night in a system of stable air pressure, and they are 
dispersed rapidly as the sun heats the earth's surface in the morning. 
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MULTIPLE NEUTRON CAPTURE IN THE MIKE FUSION 
EXPLOSION' 



























A. G. W. CAMERON 


ABSTRACT 


The constant ratio of yields of successive even or odd mass number nuclides 
produced in the Mike fusion explosion of 1952 implies an approximate constancy 
of the corresponding neutron capture cross sections for neutrons near 100 kev 
for the nuclei between U?** and U4, A discussion is given of the physical quan- 
tities on which the neutron capture cross sections depend, particularly radiation 
widths, nuclear level spacings, and neutron binding energies. It is concluded 
that conventional mass formulas are probably wrong in predicting a large de- 
crease in neutron binding energy as one goes from U** to U4, It is possible that 
the addition of a direct symmetry term to the mass formula might repair this 
difficulty. Another possibility is that an extra diffuseness of the nuclear surface 
in neutron-rich nuclei would reduce the Coulomb energy of the nucleus, thus 
stabilizing neutron binding energies. 


YIELDS FROM THE MIKE EXPLOSION 

In November, 1952, the United States Atomic Energy Commission con- 
ducted Operation Mike on an island in the Pacific Ocean. A refrigerated fusion 
device was exploded. The large neutron flux released in the explosion 
irradiated a mass of U**. Many multiple neutron capture events occurred, 
forming heavy uranium isotopes which later decayed by beta emission, leading 
to the discovery of the elements Einsteinium and Fermium (Ghiorso, Thomp- 
son, Higgins, Seaborg, Studier, Fields, Fried, Diamond, Mech, Pyle, Huizenga, 
Hirsch, Manning, Browne, Smith, and Spence 1955). A partial list of isotope 
yields from this test has been deduced by the writer from the literature 
(Huizenga and Diamond 1957); these are plotted in Fig. 1. It may be seen 





RELATIVE YIELOS 






248 252 
MASS NUMBER 







Fic.1. Yields of nuclides of various mass numbers, produced in the Mike fusion explosion, 
deduced from abundances reported and interpolated by Huizenga and Diamond (1957). 










‘Manuscript received December 4, 1958. 
Contribution from Atomic Energy of Canada Limited, Chalk River, Ontario. 
Issued as A.E.C.L. No. 760. 
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that the yields of even mass numbers are an exponentially decreasing function 
of the mass number; the yields of odd mass numbers follow a similar relation 
but are depressed relative to the curve for even mass numbers by a factor 2. 
This reflects the usual odd-even variation of average neutron capture cross 
sections. Disregarding this odd-even effect, the yield drops by a factor of 
5.8 per mass number. It has recently been revealed (Seaborg 1958) that 
the yields also follow this relation back to the vicinity of mass number 239. 

The following is a speculative description of the operation of this fusion 
device. We may suppose that the basic essentials of the device consisted of a 
uranium or plutonium fission bomb, surrounded by a liquid deuterium-tritium 
mixture, surrounded in turn by a mantle of natural uranium. The explosion 
of the central fission trigger raised the central temperature into the range 
5X 107° K to 10°° K (Bethe 1958). The heavy uranium mantle held the D-T 
mixture in place while it was heated by various energy transfer mechanisms. 
Gryzinski (1958) has suggested that a nonthermonuclear fusion chain reaction 
will take place in a D-T mixture when the temperature exceeds 107° K. 
However, let us be conservative and neglect the possibility of this process 
taking place. 

The D-T fusion reaction releases 3.6 Mev in the form of kinetic energy of 
the He‘ nucleus and produces a 14.7 Mev neutron. Again being conservative, 
let us assume that only the former amount of energy is available for heating 
the plasma. This energy is divided between kinetic energy of the particles and 
radiation energy density. At normal liquid hydrogen densities the plasma 
temperature can rise to 6X 107° K; if the D-T mixture should be compressed 
to 10 times its normal density the temperature can rise to 108° K. In the 
normal density case the mean reaction time will be about 1 microsecond (Post 
1956), while in the compressed case it will be about 0.025 microsecond. It 
seems likely that some compression will have occurred since the D-T mixture 
was squeezed between the uranium mantle and the expanding gas from the 
fission explosion. 

The energetic neutrons will have interacted for the most part with the 
nuclei of the mantle, producing fission and (”, 2”) and (n, 3n) reactions. 
The fission in the mantle produced most of the energy release in the ex- 
plosion. However, for present considerations the important effect is that each 
14-Mev neutron will have produced about three secondary neutrons mostly 
in the 1l- to 2-Mev energy range. Inelastic scattering quickly reduced the 
energy of these neutrons to the vicinity of 100 kev (Vasilyev, Zamyartnin, 
Toropov, and Fomushkin 1957). Following several elastic collisions, these 
neutrons would be captured by the surrounding nuclei, provided they did not 
escape from the system. The U** neutron capture cross section near 100 kev 
is about 0.210% cm? (Hughes and Schwartz 1957). Hence the 100-kev 
neutrons will have been captured after a mean life of about 0.25 microsecond 
in uranium of normal density. Since the uranium is likely to be compressed 
somewhat less than the D-T mixture, it may be seen that the mean D-T 
reaction time is comparable to the mean lifetime of the secondary neutrons. 
Hence the later stages of the capture will take place after the production of 
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14-Mev neutrons has nearly ceased. Since most of the production of the 
heaviest uranium isotopes by multiple neutron capture takes place in the 
very late stages of the capture process, it is reasonable to neglect any inter- 
ference with this capture process by the declining flux of 14-Mev neutrons. 

It appears that subsequent fusion explosions have not involved refrigerated 
devices, and it has been suggested that in these devices the deuterium— 
tritium mixture has been replaced by a Li®-D mixture (or compound). The 
operation of this device would be similar to that described above except that 
the neutrons which are produced are necessary for the propagation of the 
explosion. These neutrons induce Li®(m, a)H* reactions, and the tritium so 
produced can then fuse with the deuterium. Because of this additional neutron 
destruction process, the neutron flux available for capture by heavy nuclei 
will be considerably reduced. As will be seen, multiple neutron capture yields 
are proportional to a high power of the neutron flux, and hence the yields will 
be very much reduced in this type of device. If these considerations are 
correct, it appears that the Mike explosion has been a unique and important 
scientific experiment. 

If a mass consisting of some given species of nucleus is irradiated with a 
neutron flux @ for a time ¢ which is small enough so that the depletion of the 
number of original nuclei can be neglected, then the following ratio is estab- 
lished between the abundances of the m and n—1 members of the multiple 
neutron capture product chain: 


( l ) N, ; N, —~1 = Fn— ipt ‘n, 


where o,-; is the appropriate average neutron capture cross section of the 
n—1 product nucleus. Since ¢f is the integrated neutron flux, it is apparent 
that this result remains unchanged if more complicated temporal variations 
of the neutron flux are considered. 

If this were the situation in the Mike explosion, it may be noted that the 
product ratios could remain constant only if the average neutron capture 
cross sections near 100 kev increase by an order of magnitude as one goes 
from U*8 to U4. However, there are two modifying effects which must be 
considered. 

The first effect is the modification introduced by exposing a mass of U?%* 
to an integrated neutron flux which varies from one part to another. We may 
illustrate the effect of this by assuming an infinite mass of U**8 exposed to a 
flux df = ar~* where a is a constant and the flux is cut off below r = r;. Then 
the factor ¢f/n of equation (1) is replaced by a(4n—7)/nr)'(4n—3). This is 
just the maximum integrated flux a/r;* multiplied by the ratio (4n—7)/ 
(4n—3)n. The latter ratio tends to 1/n at large values of m but varies much 
more rapidly for small values of n. If this were the only modifying effect we 
would require the cross sections to increase by an additional order of magni- 
tude between U**S and U** in order to give equal abundance ratios for succes- 
sive capture products. Nearly all of this additional increase would be in the 
vicinity of U8, 











CAMERON: MULTIPLE NEUTRON CAPTURE 325 


The second modifying effect is depletion of the source nuclei. From the 
above discussion we may see that the heaviest nuclei have been predominantly 
produced in the region of highest flux. Equation (1) may be crudely inter- 
preted as indicating that the m—1 species has been present in the neutron 
flux for a time t/n. Neutron capture in this species has depleted it to a value 
exp(—o,—-:1¢t/n) = 0.8, where the numerical value is taken from Fig. 1 neglect- 
ing the odd-even effect. Hence o,_:1¢t/n = 0.23, for » ~ 12. On this basis 
the source nuclei in the region of highest flux have been depleted to a value 
~ exp(—2.8 o0/oa,-1). It may be seen that for oo = o,_; the source nuclei are 
depleted to about 6% of their initial abundance. The greatest depletions are 
in the vicinity of U*. This effect compensates for the previously supposed 
necessity of increasing the capture cross sections in the heavier uranium 
isotopes. 

A complete evaluation of these effects requires a knowledge of the inte- 
grated flux distribution in the Mike device. Even in the absence of this know- 
ledge, however, some valuable conclusions can be drawn. The abundance 
ratio of equation (1) is equal to a cross section multiplied by a factor which 
can vary rapidly only near U8 due to any of the effects considered. Hence 
it is possible to conclude that the average capture cross section fcr 100-kev 
neutrons is nearly constant among the heavier uranium nuclei, say from 
U2 to U*. This is very surprising, since, as the following discussion will 
show, one would expect the capture cross section to decrease steadily as the 
mass number of the uranium nucleus increases and its neutron binding energy 
decreases. 

In a first attempt to understand this effect, based only on the data of Fig. 1, 
it may be supposed that there is a significant drop in neutron binding energy 
beyond U**, which has 152 neutrons. There is a gap in the Nilsson diagram 
for deformed nuclei at this point (Mottleson and Nilsson 1958) which shows 
this effect among heavier nuclei on the valley of beta stability. If this were 
the case, it is conceivable that the neutron binding energy might be nearly 
constant for a considerable distance beyond U**, leading to approximately 
constant neutron capture cross sections. But this explanation would require a 
sharp kink downwards in the yield curve as one goes beyond U**, and this 
kink was not observed (Seaborg 1958). Hence it is very unlikely that there 
is an appreciable discontinuity of binding energy at U***, 

Since there is thus no known reason why the neutron binding energy and 
nuclear level density properties of the uranium nuclei should change abruptly 
immediately beyond U**, it is a reasonable conclusion that the average neutron 
capture cross section at 100 kev is approximately constant from U** to U**, 
This will be assumed in the following discussion, but it should be noted that 
this discussion would not be changed in its essentials if the approximate 
constancy were restricted to the more certain range U*" to U4, 

The remaining part of the paper contains a discussion of the physics of 
neutron capture cross sections and a subsequent interpretation of the un- 
expected results from the Mike test. 
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NEUTRON CAPTURE CROSS SECTIONS 


The neutron capture cross section averaged over resonances is (Lane and 


Lynn 1957): 


(2) (c) = » a (oy1), 


where 
(2J+1) x 1 a TrstwipT ase S 
(3 oe Sa ag ne . salle >. 
) (o 71) (27+1) k- (Dy) r ; Ti; ys 
and 
(4) Py = 7 TysaiytT ase. 
tj 


Here \ refers to the various resonances over which the average is taken, each 
of spin J, / is the angular momentum of the incoming neutron, & is 27 times 
the wave number of the incident neutron, J is the spin of the ground state of 
the target nucleus, [',,:,) is the radiation width of the resonance A, Tay (nis) 
is the entrance channel neutron width for channel spin 7 (which may take 
the values 7 = J+} except for the case J = 0 when j = 3), [yy is the total 
width of the resonance, and (D,,) is the average spacing between resonances 
of spin J and given parity at the excitation energy at which the capture is 
occurring. The contribution of inelastic scattering has been omitted from 
equation (4) on the grounds that we are interested in neutron capture pro- 
cesses after the neutron energies have been degraded by inelastic scattering 
to the point where the capture cross sections are comparable to or larger than 
the inelastic scattering cross sections. A contribution from fission has also 
been omitted since it seems unlikely that nuclei heavier than U** have sig- 
nificant fission cross sections for 100-kev neutrons. 

One commits only a relatively small error (Lane and Lynn 1957) if in 
equation (3) we make the replacement: 


Z TrsmipTascy » (Pasay) (Tas) 
5) PLAID ITO) we YY AIDE AO 
X Pyy 4 x (Thy) 


In the vicinity of 100 kev in uranium nuclei, large contributions to the 
neutron capture come from s-, p-, and d-wave neutrons; the contribution 
from f- and higher-wave neutrons is negligible. This results from the fact 
that for s-, p-, and d-wave neutrons, 


(6) (Trsimty)) > (Tyscy))s 


whereas for f-wave neutrons the smaller barrier penetrability reverses the 
inequality of relation (6). It follows that 


(gay) (Tas) 


(Tyy) 


for the lower neutron partial waves. 
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It has been found (Cameron 1957a, 1957d) that radiation widths are in- 
dependent of J and of \ to a good approximation. Hence it follows from 
relations (2), (3), (4), (5), and (7) that 


(Ty) 
(Do) 


where for simplicity the radiation width has been written as (I',), and (Do) is 
the average level spacing for the levels with J = 0. This ignores the fluctuation 
in the average of the statistical factor (2J+1)? which contributes to the 
odd-even variation in the capture cross sections ((D,;) « (2/+1)~' accord- 
ing to Cameron (1958a)). 

Therefore the results from the Mike test imply that (I',)/ (Do) is approxi- 
mately constant in the vicinity of 100 kev above the neutron separation 
energies for nuclei between U* and U*, In the following sections we will 
discuss some of the physical quantities associated with this ratio. 


(8) (o) = const. 


NUCLEAR LEVEL SPACINGS 

It was shown by Newton (1956) that nuclear level spacings could be 
accurately calculated from the usual Fermi gas model if one used appropriate 
averages of the densities of actual nucleon orbits near the Fermi level. This 
procedure allows shell effects to be taken into account. The writer (Cameron 
1958a) has used this formulation but has improved the values of the nucleon 
orbit spacings by a procedure which relates them to the masses of nuclei. The 
level spacing formula is 


Se - (Zc 3) | .. 
¢ = . , 7zZUNU - a =~ "7. oe 
(9) (D,) QI+) GzGyG".1 3 Gt > t' | exp 3 Gt, 
where 

(10) G = Gz+Gy, 

and 


3 E f ee" 
rGl2t\at 3 ; 

Here Gz and Gy are the average spacings of the proton and neutron orbits 
at the Fermi level of the nucleon gas, U is the effective excitation energy in 
Mev (defined as the actual excitation energy minus the pairing energy of the 
paired nucleons), ¢ is the nuclear temperature, and (D,) is in electron volts. 


(11) t= 


NUCLEAR RADIATION WIDTHS 
In this section we consider at some length nuclear radiation widths. Since 
these play an important part in the further interpretation of the Mike yields, 
it is necessary to establish some confidence in the validity of the basic equation 
(12) below. 
Nuclear radiation widths are slowly varying quantities. They have a 
general decreasing trend with increasing mass number, with superposed in- 








328 CANADIAN JOURNAL OF PHYSICS. VOL. 37, 1959 


creases just below closed neutron shells. The measured values collected by 
Levin and Hughes (1956) and by Stolovy and Harvey (1957) are plotted in 
Fig. 2. 
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Fic. 2. Total radiation widths plotted from the compilations of Stolovy and Harvey 
(1957) and Levin and Hughes (1956). 


. The radiation process is rather complex, since there is usually a very large 
number of levels which are energetically accessible for radiative transitions 
from the capturing state. It has been shown that electric dipole transitions 
predominate in the capture process (Bartholomew, Campion, Knowles, and 
Manning 1957). In such transitions the angular momentum spin vector must 
change by one unit and the parity must change. Hence in general any given 
spin state radiates to states of opposite parity and three values of spin. In 
calculating total radiation widths it is necessary to sum over all these transi- 
tions. 

Blatt and Weisskopf (1952) have argued that in general the matrix elements 
for the transitions to lower states will be reduced as the level spacing in the 
vicinity of the radiating state decreases. In the radiation process all but one 
of the nucleons form a core which remains unchanged; coupled to this core 
is the remaining nucleon which changes its angular momentum. The total 
radiation probability for the transition between two given states is composed 
of all the ways in which A-1 nucleons can form a common parent in the initial 
and final states while the last nucleon changes its state. The probabilities of 
admixture of these single particle wave functions into the initial and final 
states decrease as the number of states into which admixture can occur in- 
creases. Therefore, the total radiation width may be expected to be propor- 
tional to the spacing of levels (D,) near the radiation state multiplied by the 
sum of all the individual transition probabilities. 

Summations of this kind have been carried out by the writer (Cameron 
1957a, 1957d), who found that total radiation widths (in millielectron volts) 
could be approximately fitted by 


(12) Pr, = A24(D(U)) J) EdE/(Di«((U-E)). 
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In evaluating this formula level spacings calculated from equation (9) were 
used, and the integration was performed numerically for all nuclei whose 
radiation widths are plotted in Fig. 2. 

The ratios of the observed radiation widths to those calculated from equation 
(12) are plotted in Fig. 3. It may be seen that the shell structure effects evident 
in Fig. 2 have disappeared in Fig. 3. However, the points in Fig. 3 do appear 
to define a structure, the trend of which has been drawn in the figure. The 
following interpretation may be suggested for this structure. 

The admixture of single particle wave functions into the actual wave func- 
tions at the region of excitation corresponding to the neutron binding energy 
is largest in the vicinity of the appropriate maxima of the neutron strength 
functions. According to simple optical models of the nucleus the even partial 
neutron waves have strength function maxima in the same regions of mass 
number, and the odd partial waves have strength functions at intermediate 
regions of mass number. If a nucleus absorbs an s-wave neutron, the wave 
function of the compound state may be thought of as a combination of a 
large subset of the excited states of the target nucleus which combine with 
the s-wave neutron to preserve the total angular momentum of the state. 
Certain other excited states of the target nucleus couple d-, g-,... wave 
neutrons in such a way as also to preserve the total angular momentum and 
parity of the compound state. Still further representations of the compound 
state may be obtained by coupling the excited states of that parent nucleus, 
which has one more neutron and one less proton than the target nucleus, to 
s-, d-, g-,... Wave protons. In general the fractional admixtures of these 
wave functions into that of the compound state probably diminish as the 
amount of recoupling of the nucleons increases, since extensive recouplings 
usually represent highly excited states. 

The electric dipole transition matrix elements are proportional to the 
amount of overlap of these parent cores between the initial and final states. 
The overlap integrals will be larger when the admixtures of single particle 
wave functions into the initial states are larger, such as will occur in the 
regions of the s-wave neutron strength function peaks (4 ~ 55 and 160), the 
d-wave neutron peaks (4 ~ 50 and 200), and the s- and d-wave proton 
peaks (4 ~ 75 and 50 respectively). It may be seen in Fig. 3 that the curve 
is generally in accord with these expectations. 

It is also necessary to consider the strengthening of the transition matrix 
elements owing to increased admixtures of single particle wave functions 
into the final states. Most of the transitions go to states with roughly half 
the neutron binding energy (more than half of the neutron binding energy 
in the case of very heavy nuclei). Since the single nucleon changes its orbital 
angular momentum in the radiative transition, s-wave neutron capture should 
be enhanced when the middle range of excited states has large admixtures 
of p-, f-, h-,... wave nucleons. Again the lower partial waves should be more 
important than the higher ones owing to the fact that more extensive re- 
couplings correspond in general to higher excitation energies. This enhance- 
ment occurs in between the strength function peaks for the odd partial wave 
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nucleons, in the same general region as the strength function peaks for the 
even partial wave nucleons. 

These considerations lead to the suggestion that the radiation widths 
associated with p-wave neutron capture may be somewhat different from 
those associated with s-wave capture. The broad structure peaks should 
occur in different regions of mass number. There is as yet insufficient experi- 
mental data to test this prediction. The meager information available about 
radiation widths associated with p-wave capture is also plotted in Fig. 3. 
Only one of these points is a direct experimental measurement (Saplakoglu, 
Bollinger, and Cote 1958); the others have been deduced by the writer from 
the capture cross section curves measured in the kev region at Duke University 
(Bilpuch and Newson 1958). These latter points are subject to considerable 


uncertainty. 
Some of the increased radiation probability shown in Fig. 3 near 4 = 160 
and A = 240 may also possibly be due to the fact that nuclei in these regions 
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Fic. 3. Ratios of observed radiation widths to those calculated from equation (12), The 
solid line is intended as a guide to the eye to denote the trend of the points with mass number. 


are deformed. It has been found (Fuller, Petree, and Weiss 1958) that the 
giant dipole resonance is broadened in deformed nuclei. The matrix elements 
under discussion in this paper might receive some strengthening from collective 
effects in the wings of the giant resonance. 

It therefore appears that equation (12) gives a fairly good account of the 
total radiation widths. Deviations from this relation are relatively small, are 
regular in nature, and are probably susceptible to theoretical interpretation. 
It would be very desirable to check also to see whether the integrand in 
equation (12) describes the shape of the primary photon spectrum correctly. 
Unfortunately any experiment has to measure the total photon spectrum 
which includes a large number of secondary photons. The writer has made 
cascade calculations designed to convert the primary spectrum of the integrand 
of equation (12) into the total spectrum which would be measured. These 


CAMERON: MULTIPLE NEUTRON CAPTURE 331 


have been compared with measured spectrum shapes (Bartholomew, Campion, 
Knowles, and Manning 1957; Cameron and Bartholomew, unpublished). 
Fairly good agreement has been observed between the calculated and measured 
spectra, particularly in the heavier nuclei where these statistical methods are 
more reliable. These comparisons therefore give additional confidence that 
equation (12) provides a fairly good description of radiation widths. 

From this discussion it will be apparent that the results of the Mike test 
imply that the following relation holds among the heavy uranium nuclei: 


(13) (T,)/(Do) = const. [7 F3dE/(D)(U—E)) = constant. 


EXCITATION ENERGIES 

The quantity & EF°dE/(D)(U-—E)) = I of equation (13) depends very 
strongly on the effective excitation energy U. Most empirical and semiempirical 
mass formulas (see for example Green 1954; Levy 1957; Cameron 19576) 
predict that U = 2.5 to 3.0 Mev in U* as compared with 4.7 Mev in U**. 
The major contribution to the integrand of J occurs at an energy somewhat 
less than U. If these excitation energies are correct, then the values of J 
for U** and U** will be about the same only if the level spacing near U = 2 
Mev in U** is approximately the same as that near U = 4 Mev in U**. As 
may be seen from equation (9) this in turn requires that the density of nucleon 
orbits near the Fermi level in U** should be approximately double the similar 
density in U9, 

The greater part of the density of nucleon orbits in very heavy nuclei is 
contributed by the density of neutron orbits. It may be seen from an inspec- 
tion of the numbers in the appendix of the writer’s paper on level spacings 
(Cameron 1958a) that there does not seem to be a significant decrease in 
neutron orbit spacings for nuclei along the valley of beta stability beyond 
U5. Hence it is unlikely that any such effect is associated with shell structure. 

In the Fermi gas model of the nucleus a doubling of nucleon orbit densities 
would occur if the nuclear volume were doubled. It is perhaps worth inquiring 
as to whether any such effect could be associated with the smaller neutron 
binding energies in U***. We would require a radius expansion of 26%. About 
2% is given by the increase in nuclear volume in going from U** to U*®. 

A reasonable shape for the nuclear potential is (Wilets 1958): 


(14) V = Vo/(1+exp(r—R)/a), 


where Vy) = —40 Mev, a = 0.65 fermis (i.e. units of 10-’ cm), R is the 
radius of the half-central-density point, and r is the radial distance corre- 
sponding to V. This potential lies outside the nuclear matter, but if the 
binding energy of the last neutron decreases, then the nuclear matter will 
extend farther out within this nuclear potential. On this account we may 
expect the nuclear radius associated with the Fermi level to increase by 5 or 
6° in going from U** to U9, 

Clearly these increases of nuclear radius are inadequate to provide a doubling 
of nucleon orbit densities in U*® relative to U?. Hence it seems necessary to 
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conclude that the mass formulas are incorrect, and that the neutron binding 
energy in U*® is nearly as large as that in U**. 

In this connection it is perhaps useful to call attention to a possible correc- 
tion to the mass formula which has been discussed by Szamosi and Ziegler 
(1956) and by Green (1958). This is the direct symmetry term. Szamosi and 
Ziegler showed that if the neutron radius R, and the proton radius Rp, are 
different, then there should be a mass contribution of the form 


(15) AM = const. [(Rg2—Rp2)/RpRal (NV —Z), 


where V and Z are the numbers of neutrons and protons in the nucleus. 

Nuclei along the valley of beta stability have approximately equal neutron 
and proton binding energies, and it appears that for such nuclei R, = R, 
(Wilets 1958). Hence the term of equation (15) would make a negligible con- 
tribution to nuclei near the valley of beta stability to which the usual mass 
formulas are fitted. However, in the neutron rich region the argument given 
above suggests that the effective neutron radius may be significantly larger 
than the effective proton radius. In this case the term in equation (15) may 
cause a significant decrease in the masses in the neutron rich region, thus 
causing neutron binding energies to decrease more slowly than predicted by 
the usual mass formulas as neutrons are added to a heavy nucleus. 

Another possibility which the writer finds more appealing wou'd be the 
introduction of a Coulomb surface correction term. It has been indicated 
above that the density distribution of neutrons near the nuclear surface 
becomes more diffuse as the neutron binding energies decrease. Associated 
with this additional diffuseness there must be a slight enlargement of the 
nuclear potential energy distribution. It is probable that this induces an 
additional, smaller but significant, diffuseness in the surface density distribu- 
tion of the protons. The Coulomb energy of a heavy uranium nucleus is of the 
order of 1000 Mev. If the writer’s trapezoidal charge model of the nucleus 
(Cameron 1957b) is modified by increasing the surface thickness by only 
10-“ cm, while keeping the central density constant, the Coulomb energy of 
a heavy uranium nucleus would decrease by 7 Mev. This would be quite 
adequate to stabilize the neutron binding energies in the manner required 
by the conclusions of this paper. 

From this discussion it appears that measurements of yields resulting from 
multiple neutron capture can give significant new useful information about 
nuclear structure in the unexplored realm of nuclei on the neutron rich side 
of the valley of beta stability. This information is of great interest in itself, 
and is needed for a detailed understanding of the process of neutron capture 
on a fast time scale which is necessary for nucleogenesis and which may 
occur in type I supernovas (Cameron 1957c; Burbidge, Burbidge, Fowler, 
and Hoyle 1957; Cameron 19580). In this connection it is hoped that more 
information will be released on the measurements of the yields of the heavy 
rare earth activities resulting from multiple neutron capture in the fission 
products made during the Mike explosion, and also on any other measure- 
ments which relate to multiple neutron capture in the structural material of 
the fusion device. 
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It would also be very useful to conduct further experiments on multiple 
neutron capture using fusion devices with a deuterium-tritium mixture. 


The writer is indebted to G. A. Bartholomew, H. A. Bethe, J. E. Lynn, 
and A. M. Lane for useful discussions of nuclear radiation widths, to A. E. S. 
Green for discussions about nuclear masses, and to L. G. Elliott for valuable 
conversations concerning the interpretation of the Mike test vields. 
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THE TRANSPORT OF HEAT BETWEEN DISSIMILAR SOLIDS 
AT LOW TEMPERATURES! 


W. A. LITTLE? 


ABSTRACT 


The resistance offered to the flow of heat by the mismatch of the elastic 
constants at the interface between two materials has been calculated. It is 
shown that for a perfectly joined interface the heat flow is proportional to the 
difference of the fourth powers of the temperature on each side of the interface. 
Deviations from this temperature dependence are to be expected for rough 
surfaces and for surfaces pressed into contact with one another. The calculated 
contact resistance between some common solids is given, and graphs are pre- 
sented from which the heat flow between any two materials may be computed. 
It is shown too that the spin-phonon, phonon-electron, and phonon-phonon 
relaxation processes give rise to additional resistive processes in some solids, 
some of which restrict the heat flow at the surface. The theoretical results 
compare well with the available experimental data. However, the problem of 
the contact resistance between helium and metals is still unresolved. A tentative 
explanation of this is presented which predicts that there should be an appreciable 
difference between the contact resistance of a metal in the normal and in the 
superconducting state. 


INTRODUCTION 


In exveriments conducted at very low temperatures it has become well 
established that a finite difference of temperature will occur at an interface 
between two dissimilar materials when a heat flux is maintained across this 
interface. Kapitza (1941) first observed such an effect between helium II 
and a metal, and since then many investigators have reported similar results 
with helium II (Fairbank and Wilks 1955) and helium 3 (Fairbank and Lee 
1957) and between paramagnetic salts and metallic fins (Goodman 1953). 
Apart from several attempts to explain the contact resistance in the specific 
case of a helium—metal interface (Kronig and Thellung 1950; Gorter, Taconis, 
and Beenakker 1951; and Khalatnikov 1952), no general explanation of the 
effect has been offered. In view of the fact that this resistance plays a very 
important role in all experiments using cooling by contact at temperatures 
below 1° K, a general explanation is long overdue. In this paper such an 
attempt has been made, firstly, by considering the ideal case of two perfectly 
joined media and, secondly, by extending this result to several cases of practical 
importance. 


I. PERFECT JUNCTION 

Let us consider two semi-infinite isotropic solids joined in the plane sy at 

x = 0. In order to simplify the problem let us suppose that the modulus 
of rigidity of both materials is zero so that we neglect the transverse phonons 
in both solids. This greatly simplified the boundary conditions at the interface 
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and yet allows one to see the trend of the solution in the more general case 
considered in II. 

The number of phonons of energy Ay incident upon a small area dA of the 
interface per unit time at an angle of incidence between 6, and 6,+d6, will 


be: 


(1) $€,.N,(v) cos 6; sin 0;d0,d Ady, 


where c; is the group velocity of the longitudinal phonons in the first medium 
and N,(y)dv is the number of phonons of energy lying between hy and 
h(v+dyv) per unit volume of the first medium. 

Some of these phonons will be reflected at the interface and some will be 
refracted into the second medium at an angle of refraction between 62 and 
62.+d6. The two angles 6; and 6, are related by Snell’s law of acoustic refraction 
from which one may obtain the relation: 


= £2008 41 


(2) db. = ae dé, 


where ¢2 is the group velocity of longitudinal phonons in the second medium. 
The nett flow of heat across the interface from medium 1 to medium 2 due to 
these phonons leaving or entering at an angle @; is then: 


(3) Z0 = 3 Jf J xrerineres61) cos 6; sin 0:d0,dvdA 


— j J J No(v)hvc2a2(62) cos 62 sin O2d0.dvd A. 


In this expression a;(@,) is the transmission coefficient of the interface for 
phonons incident at an angle 6; in medium 1. Because the longitudinal phonons 
are the quantized longitudinal acoustic waves, this coefficient will be given by 
the equivalent expression for acoustic waves (Rayleigh 1894): 


4 p2t2 COS O2 
pic, cos; 


(4) (0:1) = a2(62) = (ea ,conte) 


p2t2 | COS 2 
pic; COS 0; 


where p; and p2 are the densities of the two media. 

The value of N(v) is given by the number of degrees of freedom of the 
lattice lying between v and v+dy, and the Bose-Einstein energy distribution. 

4er v'dv 

5 N(v)dv = —3 5787-7 J 
(5) (v) “ (e” 7kT 
Inserting (5) in (3) and integrating with respect to 4 one obtains the expres- 
sion: 


‘ 4 
(6) dQ _ 2ek TA (7:‘f(T1) — Ta‘f(T2)] ergs /second 
dt hey 
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where 


a : hvm/kT z°dz 
(7) f(T) = : ra 


z = hv/kT 


x {2 
T= f a1(6;) sin 0; cos 6; dO}. 
0 


At low temperatures one may make the usual approximation (hy,,/kT = ©) 
in the integral over z as it occurs in the Debye theory of specific heats, and 
thus obtain the relation: 


ee aig 4 
(8) dQ = ss (=) (7\4—T2") 


dt kes? 15 
5.0110 


with c,; in cm/sec and 7 in ° K. 
For small values of the ratio (7,;—7»)/7, we may write this in the con- 
venient form: 


(9) dQ = 2.010" TA 7 AT ergs/second 


dt Ce 


where 


The integral T is a complicated function of the densities and of the acoustic 
velocities of the two media. In its explicit form it is difficult to use, so it has 
been computed for a range of values of the ratio of the densities and the ratio 
of the acoustic velocities. The integral is given in graphical form in Fig. 1. 

This simple acoustic model shows that one should expect a discontinuous 
change of temperature across the interface and that for small heat fluxes the 
drop in temperature A7 should be proportional to the heat flux and to the third 
power of the mean absolute temperature. From the variation of T as shown 
in Fig. 1, with respect to the relative densities and acoustic velocities in the 
two media, it can be seen that to obtain the maximum transfer of heat from 
one medium to another the densities should be similar and the acoustic 
velocities similar. The strong dependence of T° on the relative acoustic 
velocities for c2—c, is due to the total reflection of phonons for 6; greater 
than the critical angle sin~'(c,/c2). It should be noted that the matching of 
the specific acoustic impedance of each medium (i.e., pic = p22) is not the 
condition for the maximum transfer of heat when ces > ¢y. 

The maximum value of T is 0.5, which corresponds to a perfect match of 
the densities and the acoustic velocities. In this case (6) predicts that a 
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1.0 


p 
/. 
Fic. 1. Contours of I for transmission of heat from medium 1 to medium 2 with respect 
to the ratio of the acoustic velocities c2/c, and the densities p2/p; of the two media. Poisson’s 
ratio is equal to 0.5 in both media. 


temperature difference will occur across the boundary. This apparent paradox 
is explained by the fact that it was tacitly assumed in (6) that the two media 


were each at a uniform temperature in the presence of the heat flux. This 
implies that the region in which the temperature is measured should extend 
for a distance no greater than the mean free path of the phonons on each side 
of the barrier. In the particular case of identical media sustaining a heat flux 
the theory of thermal conductivities requires that there should be a tempera- 
ture drop over a distance of the mean free path. 

That this is so can be shown as follows. If ¢) = c2 and p; = pe then a;(@) = | 
for all values of 6. Let 7 be the temperature at the barrier and let the mean 
free path of phonons in this medium be \ then the temperature 7; of a point 
where a phonon is last scattered before striking the barrier will be 
[7+ (dT /dz)d cos 6] and likewise 7, will be [7 — (07'/dz)\ cos 6]. At low tempera- 
tures and for small values of 7,;— 7.2 we obtain 


dQ eae) us 9 G2 f"< es 
dt Wee \iB 47°) a 2 J cos 6 sin 6 d6é 


cas =) 
-tex, (8 A, 


where C, = lattice specific heat unit/volume which gives a thermal con- 
ductivity of 4C,Ac,, which is the usual expression for the conductivity at low 
temperatures. 

The conduction of heat over these small distances at very low temperature 
is, of course, analogous to that in gases at low pressures. In the case of a gas 
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the heat flow for distance less than the mean free path is limited by the 
number of molecules, that is, the pressure, while in a solid the heat flow is 
limited by the number of phonons, which depends upon the temperature. 
Hence the contact resistance is related to the fact that for distances less than 
the mean free path the heat flux is not proportional to the temperature 
gradient but requires a finite temperature difference. The effect of the interface 
is to reduce this heat flux for a given temperature difference in proportion to 
the transmission coefficient of the barrier. 

It is possible to extend this acoustic model from the ideal case considered 
above to obtain a qualitative understanding of the effect of roughness and of 
imperfect contact at the interface upon the temperature dependence and 
magnitude of the contact resistance. This may be seen by first considering 
what happens if the effective area of the interface depends upon the frequency 
of the incident lattice waves. Suppose that the effective area of the interface 
A’(v) for phonons of frequency v may be written as 


(10) A'(v) = Aov", 


where ” may be 0 or a positive or negative quantity. We will assume that 
such an expression is possible over the range of frequencies of phonons which 
contribute appreciably to the heat content of the lattice. Using (10), equation 


(6) now yields 
d ~nt4 — avnta 
(11) 10 _ B(Ty —T," ) 
dt 
where B contains terms independent of the temperature. 
The heat flux across the interface now will be more or less strongly dependent 
upon the absolute temperature depending on m being positive or negative. 
Let us now consider two cases of practical interest, firstly, an interface 
which is rough but perfectly joined at all points and, secondly, an interface 
at which contact between the two media occurs at some points but not at 


others. 


The Rough Interface 
Let Fig. 2 represent the interface between the two rough media. Let A be 


defined as the mean amplitude of the roughness. If the mean free path of the 





q 





a ® “ A 
Fic. 2. Diagrammatic representation of a rough interface illustrating the larger effective 
area of the interface to short lattice waves compared with that for long waves. 


phonons is comparable to A then the effective area of the interface will differ 
for phonons of different frequency. The roughness of the interface will occur 
over a small fraction of a wavelength of a long lattice wave and, consequently, 
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the wave will be transmitted as if the interface was smooth. For these waves 
the effective area will be the macroscopic area of a plane at the interface. 
On the other hand the short lattice waves will be transmitted across each 
individual hill and dale at the interface and the effective area will be the 
integrated microscopic area of the interface. We should expect, then, the 
effective area to increase with increasing phonon energy. 

A’(v) now will be given by (10) with a positive value of m. Also, the transport 
of heat across the interface will be greater than that given by equation (8) 
due to the average effective area being greater than the macroscopic area of 
the interface. 

If, however, the mean free path is much greater than A the number of 
phonons of given energy incident upon the interface will be determined by the 
cross-sectional area of the solid normal to the heat flux and not by the micro- 
scopic area of the interface. In this case the effective area of the interface 
will be the same for all phonons and equation (8) will apply with A given by 
the macroscopic cross-sectional area of the interface. 


Imperfect Contact 

Another case of some practical interest is that of the imperfect junction of 
two media. In many experiments of contact cooling at temperatures below 
1° K, the specimen is cooled via a metallic link which is pressed into con- 
tact with a powdered paramagnetic salt. Berman (1956) has shown that the 
amount of heat transported across the interface between two surfaces pressed 
into contact is proportional to the total force applied. This suggests that the 
contact occurs at a few points, and that as greater pressure is applied the 
surfaces deform and more points make contact, the number of point contacts 
then being proportional to the applied pressure with each carrying approxi- 
mately the same heat flux. In the case of a paramagnetic salt being pressed 
in powder form against a metallic fin the contact would occur at a number of 
points. It is instructive to apply our acoustic model to this case. 

We will suppose that the interface can be represented by Figs. 3(a) and 
3(b)—a series of point contacts separated by a mean distance, B. Phonons with 
wavelengths which are very large compared with B will move all the points 
lying adjacent to one another approximately in phase, and as a result the 
whole boundary between adjacent points will move in phase (see Fig. 3(a@)). 
The mean displacement at points on the boundary not in contact with the 
first medium will be only slightly less than that at points of contact. We should 
expect, therefore, that the strain in the second medium a short distance from 
the interface will be only slightly less than it would be if the junction were 
perfect. Hence the energy transported across the interface, which is propor- 
tional to the square of the strain, will be only slightly less than that for the 
ideal case. On the other hand, for phonons with wavelengths which are small 
compared with B the motion of adjacent points of contact will differ by a 
random phase factor. This factor will be determined for each point by the 
spatial localization of the phonon and the actual value of the separation of 
the points of contact. In this case the motion of the barrier not in contact 
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with the first medium will tend to zero due to the destructive interference of 
the surface waves from all the nearby points of contact (see Fig. 3(6)). The 
mean displacement in the second medium now will be much smaller and the 
transmitted energy correspondingly reduced. 


agecsarayyuss LLL 


qaae anal 


; er 


Fic. 3. Diagrammatic representation of the interface of two solids in imperfect contact. 

(a) The surface is shown, firstly, at rest, and, secondly, at the peak of a low frequency lattice 
wave. The wavelength in this case is much greater than the mean separation of the points of 
contact B, 

(b) The corresponding picture for a high frequency lattice wave. The dotted line represents 
the surface of the second medium at rest. 


From the above considerations one would expect the effective area of the 
interface to decrease with increasing phonon frequency, giving (10) with a 
negative n. Consequently, the transport of heat across the barrier would be 
less temperature-dependent and the total heat flux less than that given by 
(8) for the macroscopic area of contact. 


Il. THE GENERAL CASE 

In Section I we have considered the simple case of the transport of heat 
between two media in which the moduli of rigidity are both zero. This is a 
somewhat artificial case so it is necessary, for practical reasons, to consider 
the more general case where both longitudinal and transverse phonons occur. 
We shall consider the case of a perfect junction. The reasoning for the rough 
and imperfect junction is equally applicable to this case so there is no need 
to reconsider these effects. 

Equation (8) may be readily generalized to add the contribution from the 
transverse phonons to the heat flow across the interface. We then obtain: 


dQ re 16 Tr; 2r, q74 m4 / 2 
(12) —= = §.01X10 A | —3+—7 | (71 —T:2 ) ergs/sec cm’, 
dt i ot; 
where r, transmission coefficient of longitudinal phonons, 
I’, = transmission coefficient of transverse phonons, 
¢, = velocity of longitudinal phonons, 


G velocity of transverse phonons. 
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In this case the detailed evaluation of T', and [, is much more difficult. The 
reason for this is that at the interface each incident wave, whether longitudinal 
or transverse, in general breaks up into four waves, reflected transverse and 
longitudinal waves, and refracted transverse and longitudinal waves (Kolsky 
1953). Because of the different velocities of the transverse and longitudinal 
waves, two critical angles of reflection are found and in the interval between 
these two angles a phase shift occurs in the refracted and reflected waves 
whose magnitude depends upon the angle of incidence. In this case there is 
no simple expression corresponding to equation (4). It is easier to calculate 
the transmission coefficient for each angle of incidence by a numerical solution 
of the equations determined by the boundary conditions. This has been done 
and the values of T,; and T, have been computed for a range of relative 
densities, Poisson’s ratio of the media, and the relative velocities of the 
longitudinal waves. 


Method of Calculation 

The ALWAC IIIE computer at the U.B.C. Computing Center was pro- 
gramed to solve the eight linear equations given by the boundary conditions 
for the amplitude and phase of each of the four possible reflected and refracted 
waves (Kolsky 1953). The fraction of the energy of the incident wave trans- 
mitted to the second medium was computed for each angle of incidence @. 
The integral was then performed using a Legendre—Gauss five point quadra- 
tures routine which approximates the transmission coefficient to a polynomial 
of ninth degree in 6. The program was checked at several points using the 
reciprocity relation: 


. y) , % ») % 
(13) | F425] - pe 
Cy Ce ij Cy Ct jt 


where the subscripts indicate the values of T, and [, within the bracket from 
medium 7 to medium j. 

The contours of I are given for a range of relative densities of the two 
media and the ratios of the velocities of longitudinal waves in the two media. 
In Figs. 4 and 5 Poisson's ratio ¢ was chosen to be 0.33 in both media, corre- 
sponding to ¢,/¢, = 2; while in Fig. (6) and (7), « = 0.33 for the first medium 
and 0.44 for the second. Poisson's ratio = 0.44 corresponds to ¢)/¢, = 3. 
The values of TI, given in Figs. 5 and 7 are the values obtained after integrating 
over all possible angles of polarization of the transverse waves. 

In Table 1 T,, Ty, and dQ/dt are given for some common solids. These have 
been computed from the room temperature densities and acoustic velocities 
as given in the -lmerican Institute of Physics Handbook. In the case of crystal- 
line solids these acoustic velocities are the values averaged over the different 
crystal directions. It has been assumed that we can approximate these solids 


to isotropic solids with these average values of the room temperature acoustic 
velocities and densities. In many cases this is not a very good approximation 
but the values of T still will be correct to within 10 to 20°% judging from the 
variations of T’ in Figs. 4, 5, 6, and 7. This accuracy is sufficient to allow 
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Fic. 4. Contours of T for the transmission of heat from medium 1 to 2 with respect to 
the ratio of the velocity of longitudinal waves in the two media and the relative densities. 
Poisson's ratio is equal to 0.33 in both media. 

Fic. 5. Contours of I for the transmission of heat from medium 1 to 2 with respect to 
the ratio of the velocities of longitudinal waves in the two media. Poisson's ratio is equal to 
0.33 in both media. 


these values to be used as a guide in selecting suitable materials for optimum 
heat transfer 

So far we have only considered the contact resistance due to the imperfect 
transmission of phonons across the interface. In certain specific cases such as 
the transport of heat from a metal to a dielectric solid or from a metal to a 
paramagnetic salt there are additional factors which can contribute to the 
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Fic. 6. Contours of I; for Poisson's ratio equal to 0.33 in the first medium and 0.44 in the 


second. 
Fic. 7. Contours of l', for Poisson's ratio equal to 0.33 in the first medium and 0.44 in the 


second. 


apparent contact resistance. This resistance does not necessarily occur at the 
points of contact of the two media but experimentally one might have 
difficulty distinguishing these effects from the true contact resistance discussed 
above. In the case of a metal in contact with a dielectric solid one has to consider 
the resistive contribution due to the finite electron—phonon relaxation time, 
whereas in the case of a paramagnetic salt in contact with a metal one has in 
addition to the phonon-electron relaxation time the spin-lattice and phonon-— 
phonon relaxation times. The contribution of these will be discussed in turn. 
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III. PHONON-ELECTRON CONTACT RESISTANCE 

Let us calculate the rate dQ/dt at which heat may be transported from the 
system of phonons in a metal to the conduction electrons. Let L(v) be the 
mean free path of a phonon of energy Av for collisions with the conduction 
electrons. We will suppose that we are concerned with very low temperatures 
so that the mean free path for phonon-phonon collisions is very great compared 
with L(y). In the simple theory of electron-phonon interactions there is no 
coupling between the transverse phonons and the conduction electrons so we 
shall calculate the transport of heat from the longitudinal phonons to the 
conduction electrons alone. This will serve to give us the order of magnitude 
of this effect. Furthermore, we will assume that on the average a fraction a 
of the energy of a phonon can be transferred to the electrons at a collision. 
We obtain the expression: 


dQ - [ood oda 
dt = avy L(v) i 


The mean free path in a metal may be calculated in terms of the electron— 


(14) 


phonon interaction and is given by Wilson (1953) as: 


(15) on ( hy ) m*"aC (42)'"4 , 
- Liv) \ReO/ ROM \3 ve 


where m* = effective mass of conduction electrons, 

a = lattice constant, 

C, = electron-phonon interaction energy 1 ev < C, < 10 ev, 
0 = Debye temperature, 


M = mass of an atom of the lattice. 


Putting (15) in (14) and using the Bose-Einstein distribution for N(v) one 
obtains the nett transfer of heat from the phonons at temperature 7, to the 
electrons at temperature 7») 


a eocg wie init ok ee” 
(16) dt = 9Ra h?Mo* (T; 7» ) Or 3 


%00 x! 
x | ae - dx ergs/sec g-atom. 
7 a 


This expression has been evaluated for copper on the assumption that m* = m, 
and C, = 3 ev, and a = 0.25 


(17) ae = 1.010" (7;°— 72°) ergs/sec/cc 


and the mean free path for the effective transfer of energy to the conduction 
electrons for phonons of energy hy = kT 


(18) L = 4 X%107(1/T) cm: 


These calculations serve to illustrate two points. Firstly, it is necessary to have 
an appreciable volume as well as a large surface area of a metal in contact 


one 


the 


tion 
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with a cooling salt in order to effect the maximum transfer of heat at very low 
temperatures. Secondly, the technique of embedding many fine copper wires 
in a paramagnetic salt to obtain good thermal contact has only limited value 
at extremely low temperatures. There is no point in using extremely fine wire 
in order to increase the surface area alone if the diameter of the wire becomes 
comparable to the mean free path of the phonons. If the diameter is of this 
order those phonons which do penetrate the surface will have a good chance 
of escaping without transferring their energy to the conduction electrons. 


IV. CONTACT RESISTANCE WITH PARAMAGNETIC SALTS 

Where a paramagnetic salt is used to cool a metal in addition to the contact 
resistance at the surface and that due to the finite phonon-electron relaxation 
time, there are other resistive elements due to the spin-lattice and the phonon 
phonon relaxation times. It appears to be generally accepted now that in 
most paramagnetic salts the spin—-phonon relaxation times are much shorter 
than the phonon-phonon relaxation times (Strandberg 1958), with the result 
that the usual equilibrium distribution of the phonons is distorted in the 
region of phonon energies close to the spin energy. 

This has a very important effect upon the transport of heat between the 
spin system and a metal. This can be shown quite simply by making the 
approximation that those phonons lying in the frequency range v; to v2 at the 
electron spin frequency are so strongly coupled to the spin system that their 
distribution between these frequency limits corresponds to that of the spin 
temperature, while those phonons outside this range are completely decoupled 
from the spin system and have a distribution corresponding to the lattice 
temperature. If this salt is in contact with a metal at a different temperature 
to the spin temperature those phonons outside the range »; to v2 will rapidly 
come to the temperature of the metal because of the small heat capacity of the 
lattice at these temperatures, while those phonons inside the range »; to v2 
will remain at the spin temperature and will be the only means by which energy 
tan be transferred between the spin system and the metal. As a consequence 
of this the thermal conductivity will be lower than that obtained by assuming 
an equilibrium phonon distribution and the contact resistance higher. In this 
case we must use the limits Av,/RT and hy/kT for the integral occurring in 
(7). We then obtain for the heat flux at the salt-metal interface: 


& ’ TP 
(19) Ze = 5.0110" (2) 1s Af{ T1f(T,:)—- T3‘f(T2)} ergs /second 


1 Cr 


where 
why/kT 3 
(20) I(T) = | 33x ds. 
Saviar (€& —1) 
There are several cases of interest in which one can approximate to the value 
of the integral. 


(a) High Temperature Approximation 
If the temperature is high enough for both Av, /-k7 and hy2/kT to be much 
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less than one and low enough to satisfy the conditions for a non-equilibrium 
phonon distribution, the exponent may be expanded and the integral becomes 
fz? de. 

Hence, 


; a 16 a | a] 1 a] — : 
? —= =! — —gea ee i — ee mao 
(21) i 5.01 X10 - cat ce 13 \e (v) —v2)(T1—Te2). 


The heat flux is now much less than that given by (8) and dQ/dT is linearly 
dependent upon the temperature difference across the interface. 


(6) Low Temperature Approximation 

If v; = 0 and hv2/kT is >1 we may use the limits 0 and @ for z in the 
integral and obtain the same relation as (8) at very low temperatures. In 
the region lying between the low and high temperature approximation one 
should expect the temperature dependence to vary from the fourth power at 
low temperatures as given by (8) to the linear dependence of (21) at higher 
temperatures. 
(c) High Field Approximation 

If a large magnetic field is applied to the salt, »; and v2 will lie close to the 
electron Larmor precession frequency 7 and we will presume that this width 
lv; —v2| = Av will be much less than 7. e? will be large compared to 1 in the 
denominator of the integral which may then be approximated to give: 


dQ E 16 a ey aa] ey (=) —hy (kT hv /kT 
g2) Sh = 5¢ =) ) +S | (2) (2 ang BA, 
(22) at 9.01 K 10 J Lette? b : fe e€ } 


In this case we should expect a strongly field and temperature-dependent 
contact resistance. 

It should be noted in the foregoing that the thermal conductivity for the 
paramagnetic salt will also be affected in the same way for each of the above 
cases. In particular, the thermal conductivity due to those phonons which 
interact with the spins will be much less than that due to the bulk of the 
phonons. Because, firstly, there are fewer phonons able to carry the heat 
from the spins and, secondly, those that are able are severely scattered by 
the spins themselves and consequently the diffusion of the spin energy to 
the surface is much slower than one would expect from the gross thermal 
conductivity. 

At very low temperatures one other limitation to the transfer of heat to or 
from the spin system might be the true spin-lattice relaxation time. One may 
optimize this heat transfer by the use of ions with strong spin-orbit coupling 
such as Crtt, Tit, and Cut? and avoid those ions in an S-state, such as Fe+++ 
and Mn***, 

V. CONTRIBUTION OF SURFACE WAVES 
In general one should not expect the surface or Rayleigh waves to play an 


important role in the transfer of heat from one solid to another. In the first 
place the total heat flux round the edges of the solid due to these surface waves 
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is much smaller than through the body of the solid. This is because the waves 
are localized within about a wavelength of the surface and all the heat must 
be transmitted through this region. The phonons or body waves are not limited 
in this way and, consequently, they can carry a far greater heat flux. In the 
second place the surface waves may be excited by the phonons themselves. 
However, the two additional interactions which are necessary to transfer 
the heat energy to and from the surface waves must introduce a further 
resistive contribution in addition to the boundary conditions which must be 
satisfied by the surface waves themselves. There are, however, a few exceptional 
cases where the surface waves may play a more important role. These are 
where: (i) the surface waves may interact directly with the conduction electrons 
or with the surface energy states in a metal; or (ii) where the acoustic 
velocities in the two media are so different that almost all the incident phonons 
are totally reflected. The totally reflected waves give rise to a surface dis- 
turbance in the second medium which can interact directly with the phonons 
or with the conduction electrons. Because of the small coefficient of trans- 
mission of the phonons and the large density of surface waves in this case, the 
role of each could be expected to be reversed and the latter could become then 
the dominant transfer mechanism. 

Further investigation is required to determine when these waves do, in fact, 
become the dominant transfer mechanism. 

DISCUSSION 

Krom the foregoing analysis one can draw the following conclusions regarding 
the contact resistance. 

(a) In general the transfer of heat between two dissimilar materials at low 
temperatures should not be a more serious problem than the thermal conduc- 
tivity because the values of [ seldom are much smaller than 0.2 for common 
solids. The surface resistance in this case would be equivalent to a little more 
than the thermal resistance of a thickness of the first medium equal to two 
phonon mean free paths. 

(6) Exceptions to these will occur in the following cases where the contact 
resistance will be abnormally high: 

(i) for materials in imperfect contact where the effective area of the bodies 
in contact is small; 

(ii) between media in which there is a large difference in the densities 
or acoustic velocities where the [' are small (see Figs. 4, 5, 6, and 7); 

(iii) between the spin system of a paramagnetic salt and a solid in contact 
with it where only a small fraction of the available phonons are able to carry 
the energy from the spin system. This should be particularly pronounced in 
the presence of a strong magnetic field. 

These conclusions can be compared with the available experimental results. 
The contact resistance from liquid helium to copper has been measured by 
several workers. Ambler (1953) obtained the expression dQ/dt = 3.4 
x 105(7'\4— 724) ergs ‘sec cm*®. This is the expected temperature dependence 
but numerically far greater than the calculated value given in Table L. Clearly 
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TABLE I 


From: To: rT; lt dQ/dT in: ergs/sec cm? 





KCr alum . 0.14 0.19 : (T\4— T24) 
KCr alum ilv 0.26 0.25 .< es 

KCr alum 0.17 0.21 

Pyrex Copper 0.34 0.36 

Lucite Copper 0.11 0.02 

Garnet Copper 0.46 0.45 

Copper Helium 0.001 0.0008 





the transmission of phonons across the interface alone is quite insufficient to 
account for the experimental results. However, in this case one should expect 
the surface waves to be of importance because the ratio of the acoustic 
velocities is greater than 10. In this particular case we have calculated the 
amplitude of the surface disturbance caused by totally reflected phonons by 
solving the boundary conditions as described earlier. It was found that the 
experimental results then can be explained if about 10% of the energy in this 
disturbance is transferred to the metal. Such a strong interaction suggests 
that the surface disturbance interacts directly with the conduction electrons. 
If this view is correct, there should be an appreciable difference in the contact 
resistance of a metal in the normal and in the superconducting state. Further- 
more, if the surface of the metal is covered with an oxide layer or a layer of dirt 
only the longer wavelength phonons would be able to penetrate to the conduc- 
tion electrons. This would give rise to a heat flux proportional to the difference 
of the third powers of the temperatures of the metal and the helium which 
would explain the results obtained by Fairbank and Wilks (1955). 

It is well known that the transfer of heat between bodies merely pressed 
into contact with one another is extremely small at low temperatures. More- 
over, Berman (1956) found the temperature dependence of this heat flux to be 
proportional to 7° at helium temperatures. His results are in agreement with 
the temperature dependence and magnitude predicted for a pair of surfaces 
in poor contact treated in Section [. 

The results of Section IV provide an explanation of the measurements of 
Goodman (1953) and more recently, of Miedema, Postma, and Steenland 
(1958) of the contact resistance from copper to a paramagnetic salt. Goodman 
measured this contact resistance and found dQ/dT proportional to the square 
of the absolute temperature. The more recent measurements of Miedema ef a. 
on a single crystal of potassium chrome alum cemented to a copper strip show 
that dQ/dT is proportional to (7’\— 7?) at high temperatures and (7\4— 72") 
at low temperatures. This agrees with the predicted temperature dependence 
obtained on the assumption of a non-equilibrium phonon spectrum. At low 
temperatures the numerical value they obtain was dQ/dt = 1.2X105(7\4 
—T>‘)ergs/sec cm*®. The calculated value in Table I is about three times 
greater than this. This is a very satisfactory agreement because the crystal 
will probably be strained close to the copper surface due to differential contrac- 
tion and this will impede further the flow of heat. 
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SOLID HARMONICS AS BASIS FUNCTIONS FOR 
CUBIC CRYSTALS! 


D. D. Betts? 


ABSTRACT 

The various sets of basis functions useful in discussing cubic crystals must 
include sets of symmetrized combinations of powers of the co-ordinates ortho- 
gonalized over the cellular polyhedron. Such polynomials are here called solid 
harmonics. A study of the actual solid harmonics reveals the limitations of the 
spherical cell approximation. The solid harmonics can be used to develop a new 
method over the cellular polyhedron of the body-centered cubic lattice or of the 
face-centered cubic lattice. 


1, INTRODUCTION 


Phenomena in solid state physics often require for their description functions 
of several variables having the symmetry of a crystal lattice, or a symmetry 
determined by that of the lattice. Thus it is desirable to have available com- 
plete sets of simple basis functions having the appropriate translational and 
rotational symmetry. Often, however, one considers functions defined through- 
out only one cellular polyhedron surrounding a lattice point in configuration 
space (or throughout a Brillouin zone in reciprocal space). In this paper we 
present sets of basis functions appropriate for the expansion of functions having 
rotational symmetry corresponding to the various inducible representations 
of the complete cubic group and defined throughout cellular polyhedron 
appropriate to the simple cubic, face-centered cubic, and body-centered cubic 
lattices respectively. We call such functions solid harmonics. 

In Section 2 the various types of basis functions commonly used are dis- 
cussed. In Section 3 the notation and method of computation of the new 
solid harmonics are discussed and tables of these functions are displayed. In 
Section 4 is discussed the adequacy in view of the results of Section 3 of the 
common approximation of replacing a cellular polyhedron by a sphere of 
equal volume. In Section 5 various applications of the new harmonics are 
mentioned, and application to the problem of integration of functions of 
cubic symmetry is discussed in detail. 

One of the main aims of this paper is to show that the usual procedure 
of factoring mathematical functions occurring in solid state physics into 
radially and angularly dependent parts has limitations not usually mentioned. 

Throughout this paper we shall be concerned mainly with functions associ- 
ated with cubic lattices. 


2. TYPES OF BASIS FUNCTIONS 
Often we wish to have a set of basis functions having both the periodicity 


1Manuscript received October 20, 1958. 

Contribution from the Division of Pure Physics, National Research Council, Ottawa, and 
the Department of Physics, University of Alberta, Edmonton. 

Issued as N.R.C. No. 5083. 

Permanent address: Department of Physics, University of Alberta, Edmonton, Alberta. 


Can. J. Phys. Vol. 37 (1959) 
350 





BETTS: SOLID HARMONICS 351 


of the lattice and a rotation symmetry about a lattice point determined by 
the rotational symmetry of the lattice. In group theoretic language such 
functions are said to belong to a representation of the space group of the 
lattice, a direct product of a translation group and a rotation group. In order 
to ensure completeness of the set the basis functions are usually chosen 
as the eigenfunctions of some differential operator. The commonest and 
simplest such functions are the plane waves, linear combinations of functions 
of the form e** having a specified symmetry. The plane waves are solutions 
of the equation 


(1) (V°+kR*)u(r) = 0. 


Consider, for example, a simple cubic lattice of period 2 with a lattice point 
at the origin of a Cartesian co-ordinate system with axes parallel to the basis 
vectors of the lattice. The functions 


W imn = Cos Imx cos mry Cos nrz+cos Mrx Cos Ny Cos lz 
+cos nrx cos Iry cos mrz+cos rx cos nry COS Mrz 
+cos nrx cos mry cos Ilmz+cos mrx cos lry cos nrz l,m,2n=0,1,2... 


are plane waves which obviously have the periodicity of the lattice. The Winn 
are also invariant under all operations which leave the cube bounded by 
x = +1,y = +1, and z = +1 invariant. In other words the Wim, transform 
according that one of the 10 irreducible representations of the cubic rotation 
group (Bouckaert, Smoluchowski, and Wigner 1936) labelled s in the spectro- 
scopic notation used by Howarth and Jones (1952). If now a second simple 
cubic lattice is added to the first in such a way as to produce a body-centered 
lattice the complete set of plane waves corresponding to the s representation 
consists of those of the above Win, for which 1+m-+n is even. It is not 
difficult to write down in this way the plane waves corresponding to any 
of the irreducible representations of the space group of any lattice. However, 
it is not our purpose to exhibit lists of these functions here. 

Another set of possible basis functions are the Mathieu functions which 
are solutions of the more complicated equation 


[V°+k?+ V(r)]u(r) = 0 


where V(r) has a sinusoidal variation such that its minima occur at lattice 
points. Mathieu functions have been discussed extensively by Slater (1953). 
For a detailed discussion on wave propagation in periodic structures see 
Brillouin (1946). For a review of methods in the one-electron theory of solids 
using various types of wave functions, see Reitz (1955). 

As opposed to functions which are continuous throughout a lattice, such 
as plane waves, there are functions in the theory of solids which have a 
‘“‘band” structure and are continuous only throughout a unit polyhedron of 
the lattice. The best known examples of such functions are the energies, E, 
as a function of the momentum vector k, in the one-electron theory of solids. 
Another such function is the angular frequency w of the normal modes of 
lattice vibration of solids again as a function of wave vector k. (See for 
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example Born and Huang (1954).) Both £(k) and w(k) are in general found 
to be smoothly varying functions of k throughout a Brillouin zone. 

As it is often desirable to expand such functions in a complete infinite set 
of basis functions (perhaps keeping as an approximation only a finite number 
of terms), the question arises as to what is the most appropriate basis set. The 
symmetrized plane waves already are orthogonalized over a cellular poly- 
hedron giving them an important advantage. However, the simplest set of 
linearly independent functions having rotational symmetry consists of the 
combinations of powers of k,, k,, and k, (or x, y, and 2) as long as translational 
symmetry is not required. Such functions are linearly independent but not 
orthogonal to one another throughout a cellular polyhedron. Apparently the 
only way to construct an orthogonal set of such functions is through the 
well-known Schmidt orthogonalization process. We shall call sets of poly- 
nomials orthogonal over three-dimensional solids solid harmonics. The solid 
harmonics can be regarded as those solutions of the differential equation 
V?u = 0 which are regular at the origin. 


3. CONSTRUCTION OF SOLID CUBIC HARMONICS 


The remainder of this paper will be concerned with the three monatomic 
lattices, simple cubic, body-centered cubic, and face-centered cubic. Special 
attention will be devoted to the latter two because only they are realized in 
actual crystals. The cellular polyhedra for the three lattices mentioned are 


a cube, a truncated cube or truncated octahedron, and a rhombohedral 
dodecahedron respectively. The latter two figures are illustrated, for example, 
by Reitz (1955). 

The problem is to pick out those combinations of the powers of x, y, and z 
which transform according to each of the 10 irreducible representations of 
the cubic (or octahedral) group O, and then to use the Schmidt process to 
make the linearly independent polynomials orthogonal to one another over 
each of the above solids and, for comparison, a sphere. 

Actually, for the cube it is unnecessary to use the Schmidt process because 
polynomials of the form 


P(x) Pm(y) Pr (2) L ie O42. s 


where P(x) is the Legendre polynomial of degree /, are orthogonal to one 
another already over the cube bounded by x = +1, y = +1, 2 = +1. Thus 
the complete orthogonal set of polynomials in x, y, z over the cube belonging 
to representation s (invariant under any permutation or change of sign of 
x, y, and z) are of the form 


(3) Cernig: y, 2) = P21(x) Pom (y) Pon (3) + Pom (x) Pon (y)P21(2) 
+ Po» (x) P21(y) Pom (2) + P21(%) Pon (y) Pom (2) + Pon (x) Pom (y)P21(z) 
+ Pom (x) P21(y)P2n (2) bs m,n = ), ] 2 a 


9 Hy ee 


The general expressions for the solid cubic harmonics belonging to the other 
representations are also easily written down. Picking out the combinations 
of x, y, and z having the symmetry appropriate to a given representation 
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is accomplished with the aid of the group character tables of Bouckaert, 
Smoluchowski, and Wigner (1936) or more directly with the aid of the poly- 
nomials of Von der Lage and Bethe (1947). It must be remembered that 
the x, y, and z of the latter authors are really the direction cosines x/r, y/r, 
and 2/r in our notation. For example, 1, x?+y?+2? = 7’, x‘+y'+<2!,r‘4 are 
linearly independent polynomials belonging to the s representation while 
x, y, 2, x3, y’, 23, rx, r’y, r°z belong to the p representation. 

Performing the integration necessary to obtain an orthogonal set is elemen- 
tary and straightforward but, especially in the case of the truncated cube or 
truncube, extremely tedious. All the integrals needed in order to construct 
the orthonormal sets of all truncubic, dodecahedral, and spherical cubic 
harmonics of degree less than five are listed in the Appendix. 

It is important to have an adequate notation for the solid harmonics. The 
kernel or principal letter of the symbol indicates the region over which the 
harmonics are orthogonal, C, S, 7, and D corresponding respectively to cube, 
sphere, truncube, and dodecahedron. The representation (giving the trans- 
formation properties of the harmonics) is indicated by a spectroscopic sub- 
script. The harmonics are further distinguished by two numerical superscripts 
whose sum indicates the degree of the harmonic. Thus 7%" indicates one of 
the harmonics of degree three with one nodal plane through the origin nor- 
malized and orthogonal to all other p type harmonics over the unit truncube. 
The order in which the independent polynomials are taken to form successive 
harmonics of the same representation is determined first by degree; then an 


arbitrary but consistent choice is made among those of the same degree. All 


solid harmonics of degree less than five are listed in Table I. 

It should be noticed that the solid spherical harmonics St?” orthogonal 
over the unit sphere are just products of Von der Lage and Bethe’s (1947) 
Kubic Harmonics and simple polynomials in r. 


4. ADEQUACY OF THE APPROXIMATION OF SPHERICAL CELLULAR 
POLYHEDRA 

In the cellular method of Wigner and Seitz (1933) in the one-electron 
theory of solids the approximation is made of treating the cellular polyhedra 
as spheres. Only in a “spherical cellular polyhedron” can the one-electron 
Schrédinger equation be separated into radially and angularly dependent 
equations. If the usual assumption is made that the potential depends on r 
only then the solutions of the angularly dependent equation are the ordinary 
surface spherical harmonics, Y;,,(0, ¢). The approximation of a spherical cell 
becomes more realistic if the combinations of the Y;, used transform according 
to one of the irreducible representations of the cubic or octahedral group 
O,. Such combinations are the Kubic Harmonics of Von der Lage and Bethe 
(1947). In one sense the same approximation has been used by Houston (1948) 
in developing his method for determining the specific heat of lattices from the 
frequency distribution of the normal modes of the lattice vibrations. It must 
be recognized that various authors such as Horton and Schiff (1958) and 
Howarth and Jones (1952) have compensated in a reasonable way for the 


assumption of a spherical cell. 
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An examination of the integrals in the Appendix shows that the average 
values of the even powers of 7 over the unit sphere, (r*”), are very nearly 
equal to the average values over the unit truncube, (r"),. The greatest 
difference, amounting to 4%, is found for (7°). We take this as some evidence 
that for functions of r only the approximation of replacing the cellular poly- 
hedron of a body-centered lattice (or the Brillouin zone of a face-centered 
lattice) by a sphere is not a bad approximation. If we examine the table of 
harmonics, we notice that the corresponding coefficients of the S, and the 7, 
are quite similar as indicated by a comparison of the unnormalized harmonics 
below. 


S; ~ r°—.600 T? ~ r*—.504 


S} ~ r'—1.111 1°+.238 Tt ~ r*—1.085 r°+.216 

Si ~ xi +y'+z'—.600 r' Ty ~ x'+y'+2'—.663 r‘—.064 r°+.042 
Also the 7), and S,, are rather similar. 

Sui ~ 1° —.778 Tai ~ r’—.821 

i x+y" —.857 r ta ~ x’+y"—.834 r°—.039 


Thus we tentatively propose on the basis of fairly meagre evidence that the 
spherical cell may be a fairly good approximation to the cellular polyhedron 
of the body-centered cubic lattice for s and d' type functions. For p and d? 
type functions, however, Table I seems to indicate that the spherical approxi- 
mation is inadequate. Also in the face-centered lattice the spherical approxi- 
mation seems to be poor for functions of any of the types s, p, d', and d®. No 
conclusions can be drawn about functions of the other six types. Nevertheless, 
it is often expedient to use the spherical cell just because of the great com- 
putational complexity involved in solving non-separable three-dimensional 
Schrédinger equations. 


5. APPLICATIONS 
The solid harmonics should be useful as basis functions for expansion of 
smoothly varying functions of cubic symmetry in cellular polyhedra or 
Brillouin zones of cubic lattices. However, the simple unorthogonalized 
polynomials are adequate in many cases, such as in the expansion of the 
one-electron energy, /(k), as 


(4) E(k) = Eotak’+bk'+c(Ritkit+ki)+.... 


Only when integration of the expanded function is to be performed is it more 
useful to expand in terms of the orthogonal solid harmonics. 

In the theory of specific heats of lattices, for example, it is necessary to 
integrate over the cellular polyhedron a function of k in general so compli- 
cated that one can obtain analytic expressions for it only for certain special 
sets of values of k, typically k, = Oork, = k,. Houston (1948) took advantage 
of this fact in developing his method, which samples the integrand in the 
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three directions k, = k, = 0, k, = k, and k, = 0, and k, = k, = k, and 
obtains the angular dependence of the integrand by fitting a linear com- 
bination of the first three Kubic Harmonics. Houston’s method suffers from 
the disadvantage that in converting the variable of integration from & to w 
infinities are artificially introduced. This difficulty is avoided by Horton and 
Schiff (1958), but they still use the Kubic Harmonics as interpolative functions. 

We propose that a use of the appropriate solid harmonics for such an inter- 
polative scheme is ultimately capable of the greatest accuracy. The method 
can be described in general terms but we prefer to illustrate it with a concrete 
example. In Fig. 1 (a) is illustrated the 1/48th of the rhombohedral dodeca- 
hedron containing all the non-equivalent points. Following Bouckaert, 
Smoluchowski, and Wigner (1936) we have labelled the origin and the other 


x 


Fic. 1. (a) The polyhedron containing all the symmetrically non-equivalent points in the 
rhombohedral dodecahedron. 
(6) The polyhedron containing all the symmetrically non-equivalent points of the 
truncated cube. 


corner points IT, H, N, and P respectively. The scale of our figure is con- 
veniently fixed by letting H have co-ordinates (1,0,0) then NV (3, 3,0) and 
P (3, 4, 3). Instead of the completely orthogonal set of solid harmonics of 
Table I we introduce the s-type harmonics 


d= 1 
d' = x°+y?4+2?—3/8 
d? = x'+y'+2!—9/80 

(5) d!! = x*y?+-y’s?-+-2%x? —7/240 


q'm = x2 ly2mgen 4 yrigamyan_t 92 My 2my2" 4 y2lg2my2n 4 y2ly2moon 
+2? by2my2n 4 a imn 
The constant a’ is chosen to make the harmonic d’”" orthogonal to d°. The 


integrals in the Appendix are sufficient to enable us to write down immediately 
the first 11 harmonics of this kind. 
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Now let f(x, y,z) be an s-type function which it is required to integrate 
over the rhombohedral dodecahedron. The expansion 


(6) S(x, 9,8) = x Bima d'™" (x, ¥, 2) 


is exact if the whole infinite set of d’s is used. The expansion is of a certain 
degree of approximation k if all harmonics of degree 2k or less are used. 


Now 


(7) nea ~te oe 


dodecahedron dodecahedron 


= bp X volume of dodecahedron because of the orthogonal 
property of the d’s. 


The ‘‘second approximation”’ to bo is found by solving the four equations 
derived by evaluating the quantities in eq. 6 at the four special corner points 
I, H, N, and P and keeping only the first four harmonics d°, d', d!!, and d?. 
The result is 


(8) by = (71fp+%fu+336fy —56fp) /360. 


For the truncube we can define harmonics ¢’". The integrals tabulated in 
the Appendix analogous to (5) allow us to write down the first 11 immediately, 
of which the first 4 are 


(9) - | 
e x?+ y?+2? — 19/32 
? = x4+ y!+2!—89/320 
tl = x®y?+ y22?+ 22x? 71/960. 


A second degree approximation gives 
(10) bo = (3659f,.+5037fy —3840fx + 13,864f,) /18,720 


for the truncube. 

The integration formulas given above which sample the integrand only at 
four points are obviously quite inadequate for all but the smoothest functions. 
However, the extension of formulas like (8) and (10) to higher degrees of 
approximation is straightforward and should become quite accurate for a 
reasonably small number of points. Houston’s method has been found quite 
adequate in most cases of practical interest by Betts, Bhatia, and Wyman 
(1956) when extended to include six points. By analogy for a three-dimensional 
rather than a two-dimensional region of integration perhaps of the order of 
15 or 20 points would be quite adequate. Further investigation of such formulas 
will be carried out. 

It appears that the same sort of formulas as (8) and (10) could be obtained 
using plane waves instead of solid harmonics. There are two objections to 
such a proposal. First, one would expect that an expansion of a smoothly 
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varying function such as w in solid harmonics would converge more rapidly 
than an expansion in plane waves just as in one dimension we normally 
expand smoothly varying functions in power series rather than a more slowly 
converging Fourier series. Second, it turns out that it is very difficult to make 
a reasonable choice of points for integrand evaluation such that the equations 
like those derived from (6) are all linearly independent. 

No obvious application of solid harmonics can be made to the problem 
of determining crystal wave functions because of the great complexity of 
non-separable three-dimensional one-electron Schrédinger equations. 


6. CONCLUSIONS 


Among the various types of basis functions used in the study of solids 
ought to be included the solid harmonics. It would be of considerable interest 
to have a method of generating solid harmonics less laborious than the Schmidt 
process, although perhaps an adequate number for many purposes can be 
obtained in this way. The spherical cell approximation seems to be fairly 
good for s and d! type functions in the body-centered cubic lattice unit cell. 
It seems to be quite poor for functions of p or d? symmetry in the above 
cell or for functions of s, p, d', or d? symmetry in the unit cell of the face- 
centered cubic lattice. Nothing can be concluded about the spherical approxi- 
mation for functions of any of the other six symmetry types in either lattice. 

The use of solid harmonics promises to provide a new method of numerical 
integration of smoothly varying functions of cubic symmetry which should 


require far fewer points than the usual straightforward method of numerical 
integration for the same degree of accuracy. For integrands with discon- 


‘ 


tinuities, however, the usual method with its “‘unprejudiced”’ weights should 


be better. 
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APPENDIX 
Normalized integrals of polynomials 








Region of integration 


Polynomial 


Sphere 





1 
r2 
x! 


l 

3/5 
3/35 
1/35 
3/7 
1/21 
1/105 
1/315 
1/3 
1/33 
1/231 
1/385 
1/1155 
3/11 


Truncube 


1 
19/32 

89/960 
71/2880 
409/960 

1237 /21,504 
1/120 

457 /276,480 
6703/20, 160 
377 /9216 
2857 /645,120 
743 /645,120 
2573/6,451,200 


38,281 /134,400 


Dodecahedron 


1 

3/8 

3/80 

7/720 
41/240 
83/5376 
31/16,128 
37 /48,384 
463 /5376 
31/3840 
1/1792 
37/134,400 
53/403,200 


2119/44,800 





THE PRESSURE-INDUCED ROTATIONAL ABSORPTION 
SPECTRUM OF HYDROGEN 


I. A STUDY OF THE ABSORPTION INTENSITIES! 
Z. J. Kiss,? H. P. Gusu,? AND H. L. WELSH 


ABSTRACT 

The pressure-induced infrared spectrum of Hz and mixtures of Hz with Ne, He, 
Ne, A, Kr, and Xe was measured in the region 300-1400 cm at total gas 
pressures up to 250 atm at 300° K and, where possible, at 195° K and 85° K. 
The spectrum shows greatly broadened S lines (AJ = +2) with half widths 
which decrease as the temperature is lowered. The integrated absorption co- 
efficient of the band is of the form aipx,p,, where p, is the density of the perturb- 
ing gas, except in the case of Xe for which a cubic term, a2pH»9,’, is also necessary. 
The binary coefficient a; increases by a factor of 28 in going from He to Xe. 
The theoretical band intensity, calculated on the basis of quadrupole inter- 
action alone, is in good agreement with the experimental value only for Xe as 
perturbing gas; in other cases the calculated value is appreciably less than the 
observed value. The shape of the absorption contours suggests that the S lines 
are overlaid by a continuum increasing in intensity towards lower frequencies. 
This continuum is interpreted as the counterpart of the Qg component in the 
vibrational band, that is, a collision-induced absorption due to overlap inter- 
action in which the relative kinetic energy of the collision partners changes in 
the absorption process. 


INTRODUCTION 


The infrared vibrational spectrum of hydrogen, studied under widely 
different experimental conditions, has served to delineate many features of 
collision-induced spectra (Chisholm and Welsh 1954; Hare and Welsh 1958). 
A simple picture of the origin of these spectra is the following: The inter- 
molecular forces operative during collisions produce a distortion of the electron 
distributions of the molecules. The resulting electric dipole moment of the 
system is modulated by the vibration and rotation of the colliding molecules, 
so that molecular frequencies, which are inactive because of symmetry in the 
free molecule, appear in infrared absorption. In the region of densities in 
which only binary collisions are important, the intensity of the induced 
absorption is proportional to the square of the density of the pure gas; the 
enhancement of the absorption by a foreign gas is proportional to the product 
of the densities of the absorbing gas and the perturbing gas. As shown by the 
structure of the fundamental vibrational band of hydrogen the selection rule 
for rotational transitions in homonuclear diatomic molecules is AJ = 0, +2, 
where J is the total angular momentum quantum number; this selection rule 
is a consequence of the fact that these molecules have an electrical center of 
symmetry which coincides with the center of gravity. 


1Manuscript received December 5, 1958. 

Contribution from the McLennan Laboratory, University of Toronto, Toronto, Ontario. 
This research was supported by Grant No. T54 of the National Research Council of Canada. 

*Holder of a scholarship from the National Research Council, 1957-58. 

’Holder of a scholarship from the National Research Council, 1955-57. Present address: 
Laboratoire Aimé Cotton, C.N.R.S., Bellevue (S. et O.), France. 


Can. J. Phys. Vol. 37 (1959) 
362 























363 





KISS ET AL.: PRESSURE-INDUCED ROTATIONAL ABSORPTION, I 


The theory of the induced fundamental band of hydrogen has been given 
by Van Kranendonk and Bird (1951), Van Kranendonk (1952, 1957, 1958), 
and Britton and Crawford (1958), who have shown that the absorption must 
be explained in terms of electron overlap and molecular quadrupole inter- 
actions. The overlap interaction appears to be almost spherically symmetric 
and thus produces mainly transitions for which AJ = 0 (Q branch); the 
quadrupole interaction on the other hand is strongly dependent on the mutual 
orientation of the molecules in a pair and produces transitions for which 
AJ = +2 (S and O branches) and, in addition, a small contribution to the 
Q branch intensity. 

Any homonuclear diatomic molecule should also show an induced spectrum 
in which purely rotational transitions are involved. However, only in the 
case of hydrogen would this spectrum lie in a region accessible to the more 
conventional infrared techniques. The Raman spectrum of hydrogen at room 
temperature shows four rotational lines, S(J) with J = 0, 1, 2, 3, at 354.4, 
587.1, 814.4, and 1034.7 cm~', respectively (Stoicheff 1957); in the induced 
infrared spectrum these lines are all in the region of CsBr optics. From the 
theory of the vibrational band one would expect that the rotational lines of 
hydrogen would appear in absorption mainly as a result of quadrupole inter- 
action; it is therefore possible that, under certain conditions, the value of the 
quadrupole moment of the hydrogen molecule could be calculated from the 
intensity of the absorption. 

The S(2) and S(3) rotational lines were observed by Ketelaar, Colpa, and 
Hooge (1955), and in a recent paper Colpa and Ketelaar (1958) have given 
data for the S(1), S(2), and S(3) lines in pure hydrogen and in mixtures 
with helium, argon, nitrogen, and carbon dioxide at —60°, 25°, and 80° C. 
The present investigation was designed to obtain experimental data for the 
whole rotational spectrum in the pure gas and in mixtures with nitrogen, 
helium, neon, argon, krypton, and xenon at room temperature. The heavier 
rare gases were included since it seemed probable that for these the quadrupole 
effect would be relatively much larger than the overlap effect, as compared 
with perturbing molecules with smaller polarizabilities. Experiments on the 
vibrational band of hydrogen have shown that many features of the spectrum 
become clearer at low temperatures; the investigation of the rotational spec- 
trum was therefore extended whenever possible to 195° K and 85° K. In the 
present paper the data on the integrated absorption intensities are discussed ; 
in a future communication an analysis of the absorption profiles will be given. 


EXPERIMENTAL 


To record the whole rotational spectrum of hydrogen it was necessary to 
use windows of caesium bromide in the pressure tube. Since caesium bromide 
is relatively soft some difficulties were experienced from the plastic flow of 
the crystal under pressure. In the design finally adopted circular windows, 
25 mm in diameter and 13 mm thick, were used with a free area of 10 mm 
xX 5mm; the rectangular free area was used in preference to a circular area 
of the same size, since the shear strength for the former is greater than for 
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the latter as a consequence of the greater shear area. The deformation of the 
window under pressure was studied by placing a glass rod connected to an 
optical lever in contact with the outside surface of the window. The amount 
of extrusion as a function of the pressure is shown in Fig. 1 (a), the pressure 
being applied in each case for 10 minutes. The extrusion as a function of time 
is shown in Fig. 1 (0) for two different pressures. It was noted that the plastic 
flow takes place in a preferential direction in the crystal, so that additional 


3000 PSI 








c 
Extrusion (mm 


~ 04 
2000 PSI 





ime Min 





Fic. 1. Measurements of the extrusion of CsBr windows under pressure: (a) amount of 
extrusion as a function of pressure applied for 10-minute intervals, (b) extrusion as a function 
of time. 


strength in the window might be obtained by a proper orientation of the 
crystal axes; however, no attempt was made to do this in this investigation. 
As a result of these experiments it was concluded that the windows used could 
support a pressure of 300 atm for an extended period without serious deforma- 
tion. It was necessary, however, to renew the windows frequently since they 
had a tendency to crack when the pressure was released. 

The high pressure absorption cell (Fig. 2) was of the transmission type 
with a path length of 26.7 cm and was designed to withstand pressures up 
to 500 atm. The body of the cell, A, was a stainless steel cylinder with a 
central bore, 1.2 cm in diameter, into which was inserted a light guide, B, 
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Fic. 2. Design of the absorption cell for high pressures and low temperatures, 
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with a rectangular cross section, 7 mm X 2.5mm. The apertures of the cell 
were designed so that f/4 cones of radiation could be used to focus the source 
and the spectrometer slit on the entrance and exit ends of the light guide 
respectively. The window, C, was carried by the hardened steel ring, D, to 
which it was attached by a thin layer of rubber cement. The ring, D, was 
sealed to the body of the cell with pressure applied by the closing nut, E, to 
the lead gasket, F. The incorporation of a tightly fitting invar ring, G, in the 
design maintained the pressure seal at temperatures down to 80° K. In the 
low temperature experiments the tube was immersed in a trough containing 
liquid air or solid carbon dioxide and acetone, and insulated with styrofoam; 
CsBr lenses attached to the ends of the absorption cell by evacuated monel 
tubes were used as condensers. 

Since some of the absorptions were very weak in the pressure range which 
could be used, a second absorption cell with a path length of 245 cm was also 
employed. The design of the end seals of this tube was the same as that 
described above, and a silver wave guide served as the light guide. 

The spectra were recorded with a Perkin—Elmer 12C spectrometer equipped 
with a CsBr window and a thermocouple detector. In preliminary experiments 
with the cell at liquid air temperature and the chopper in its normal position 
in front of the spectrometer slit, it was noted that a negative signal was 
recorded in regions of low absorption. It was apparent that the thermocouple 
was radiating energy to the cold cell and that this chopped back-radiation 
was responsible for the negative signal. The effect was eliminated by placing 
the chopper between the source and the cell. In the experiments at room 
temperature the source was a Globar operated at 4 amp from a Sorenson 
voltage regulator. Better results were obtained in the later experiments at 
low temperatures using a commercial Welsbach mantle heated by an oxygen- 
gas flame. Since the intensity of both of these sources falls off rapidly at long 
wave lengths it was found necessary to alter the slit width of the spectrometer 
and the gain of the amplifier for each of the four regions 5-12 yu, 12-18 g, 
18-26 uw, and 26-33 uw; however, an effective resolution of at least 5 cm~! was 
preserved throughout the range. To reduce the effects of stray radiation the 
metal plate of the beam chopper was replaced by a fused quartz plate so that 
only radiation of wave length greater than 4 « was detected. With this device 
the effect of stray radiation was negligible except at wave lengths greater 
than 25 uw; for this region the line of infinite absorption on the recorder trace 
was obtained by covering the slit of the spectrometer with a thick plate of 
rock salt. The frequency calibration was obtained from the atmospheric 
water vapor and carbon dioxide bands in the region 5-9 and 23-33 u, 
supplemented in the region 9-11 4 by the spectrum of ammonia gas at a 
pressure of 10 mm Hg in an absorption cell 10 ¢m long. 

The gases were compressed with a hand-operated 1500-atm gas com- 
pressor, and the pressure in the cell was measured with a calibrated Bourdon 


tube gauge. To eliminate water vapor absorption the gases were dried by 
passage through a cold trap. Mixtures of gases were made at lower pressures 


in the chamber of the gas compressor and it was assumed that the density 
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of each gas could be calculated from its partial pressure. The densities of the 
gases were calculated from the measured pressures and temperatures; iso- 
thermal data for all the gases are available in the literature for the temperature 
and pressure ranges used. 

In experiments with compressed hydrogen gas at lower temperatures 
ortho—para conversion, catalyzed by the steel of the absorption cell, takes 
place slowly. Since it is essential to know the ortho-para ratio in calculating 
the absorption coefficients, the experimental data were obtained under con- 
ditions for which there was no conversion (normal hydrogen) or for which 
there was complete conversion (equilibrium hydrogen for the temperature in 
question). To obtain normal hydrogen at lower temperatures a leak was 
introduced in the absorption cell so that the contents of the cell were renewed 
from the chamber of the gas compressor at room temperature every 3 minutes; 
control experiments showed that in this period of time the ortho-para con- 
version was negligible. An equilibrium ortho-para mixture at 85° K was 
prepared by passing hydrogen through a copper coil filled with alumina gel 
coated with chromic oxide and cooled to 85° K; some of the catalyst was 
also placed in the high pressure cell itself. The conversion was followed by 
recording the signal level at the position of the S(1) line and appeared to be 
complete after 3 hours. 


EXPERIMENTAL RESULTS 
Table I summarizes the experimental conditions for which one or more sets 


of absorption profiles of the rotational band were obtained. Each experiment, 


TABLE I 
Outline of the experiments 











Temperature, Optical path Maximum 

Perturbing gas ak. length,cm density, Amgt 
Normal He 300 27 200 
300 245 45 
195 27 100 
85 27 120 
Equilibrium He 85 27 120 
He 300 27 150 
300 245 70 
195 27 250 
Ne 300 7 80 
300 245 50 
195 27 80 
A 300 Zz 120 
195 27 120 
Ne 300 27 120 
195 27 120 
Kr 300 27 75 
Xe 300 27 70 


for a given perturbing gas and a given temperature, consisted in recording 
the spectrum from approximately 300 cm—", the low frequency limit of observa- 
tion, to 1400 cm~ for a series of six or more different pressures. 
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Examples of the absorption profiles (logio(Io/Z) vs. frequency in cm~') are 


shown in Figs. 3, 4, 5, and 6. In these graphs the frequencies of the Raman 
lines of gaseous hydrogen are marked on the abscissa axis. In the band for 
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Fic. 3. Absorption of pure hydrogen in the region 300-1400 cm~ at room temperature. 
Path length, 27 cm. The Raman frequencies of the gas are marked on the abscissa axis. 


Fic. 4. Absorption of hydrogen-helium mixtures in the region 300-1400 cm™ at room 
temperature. Path length, 245 cm. 


Hydrogen - Xenon 
298°K 
27 Amagat H, 365 Amagat Xe 
153 ‘ 365 ‘ 
53  ° 280 
(53 190 


Frequency (cm 


Fic. 5. Absorption of hydrogen-xenon mixtures in the region 300-1400 cm at room 
temperature. Path length, 27 cm. 
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normal hydrogen at room temperature (Fig. 3) the four lines S(O), S(1), S(2), 
and S(3) are readily distinguished, although they are very broad and there 
is appreciable overlapping. For the H2-Xe mixture (Fig. 5) the lines are 
comparatively sharp and it is evident that the maxima lie close to the Raman 
frequencies. In contrast, the H,He mixture (Fig. 4) shows a much weaker 
over-all absorption and the S lines are poorly defined. 
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Fic. 6. Comparison of the absorptions of normal hydrogen at 298° K, 195° K, and 85° K 
and equilibrium hydrogen at 85° K. Path length, 27 cm. 


The dependence of the absorption of pure hydrogen on temperature and 
ortho—para ratio is shown in Fig. 6. The half width of the lines decreases 
markedly with temperature as a consequence of the longer duration of collisions 
at lower temperatures. Only the S(O) and S(1) lines are recorded at 85° K 
since the higher rotational levels are not appreciably populated. The greater 
intensity ratio, S(0)/S(1), for the equilibrium mixture at 85° K is due to the 
greater para concentration, 45.8%, as compared with 25% for normal hydrogen. 

Integrated absorption intensities, fadv, were obtained by multiplying the 
areas under the absorption profiles by 2.30//, where / is the path length. Since 
a small portion of the band at frequencies less than 300 cm~! was not recorded, 
it was necessary to extrapolate the experimental curves by assuming a shape 
for the low frequency wing of the S(O) line similar to that observed for the 
S(1) line; the correction is quite small so that this procedure can be con- 
sidered adequate. In Fig. 7 the values of (1 pu) adv at 300° K are plotted 
against the density of the perturbing gas in Amagat units. In the case of 
gas mixtures fadv refers to the enhancement by the foreign gas and is obtained 
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by subtracting from the total intensity the intensity due to H.-H: collisions, 
which can be calculated from the data for the pure gas. In all cases except 
the H.-Xe mixture, (1/pnz)f adv is a linear function of p,, so that the integrated 
intensity has the form fadv = apy,” for the pure gas, and fadv = aipH.pys for 
the mixtures. The constant, a, is thus a specific absorption coefficient for 
binary collisions and can be conveniently expressed in the units, cm~? Amgt~, 
using the Amagat unit of density. 
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Fic. 7. The variation of the integrated absorption intensity of hydrogen in the region 
300-1400 cm~! with the density, ps, of the perturbing gas. 










The graph for the H,-Xe mixture in Fig. 7 departs noticeably from a 
straight line; here the integrated intensity follows the relation, 





fadv = APH Pxet @2PuPxe 





2 


where a, = 1.74X10-? cm? Amgt~ and a, = —5.5X10->' cm Amgt~. The 
small cubic term shows some effect of ternary collisions of the type Xe—H.—Xe. 
According to general calculations of Van Kranendonk (1957) this term is 
given by the superposition of three effects: (a) the density dependence of the 
distribution function for pairs of molecules (finite volume effect), (6) the 
partial cancellation of the two moments induced in a ternary collision of the 
above type, and (c) non-additivity of the dipoles induced in ternary collisions. 
The cancellation effect (6) leads in general to a negative cubic term and is 
probably chiefly responsible for the cubic coefficient observed for the H.-Xe 
mixture. The reason for the appearance of the cubic term in the single case 
of the H.-Xe mixture may be due to the fact that the experiment was carried 
out close to the critical point (16° C) of xenon; it may therefore be indica- 
tive of molecular clustering in the neighborhood of the critical point. This 
hypothesis could be readily tested by extending the experiments to higher 
















temperatures. 
The effect of lower temperatures on the integrated intensity is illustrated 
in Fig. 8 where the results for normal hydrogen at 300°, 195°, and 85° K are 
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given. The decrease in the value of a; as the temperature is lowered is quali- 
tatively explained by the greater distance of closest approach of the molecules 
in collision, giving a smaller average interaction. Figure 9 shows that increasing 
the para concentration at 85° K gives a slight decrease in a}. 
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Fic. 8. The effect of temperature and ortho-para ratio on the integrated absorption 
intensity of hydrogen. 
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Fic. 9. Variation of the experimental values of a and calculated values of ai(quad.) with 
the molecular polarizability, a, of the perturbing gas. 


For comparison with theory it is convenient to introduce a different measure 
of the intensity, &, in which the effects of (a) the “‘apparent”’ induced absorp- 
tion and (b) the varying absorption frequency, v, have been eliminated. Appar- 
ent induced absorption is the slight enhancement of any absorption which 
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results from the long-range dipole-dipole interaction between molecules. 
According to Van Kranendonk (1957) experimental intensities can be corrected 
for apparent induced absorption by multiplication with the factor, 


1/2 3/2 
so) = (148 an) Gs. Si ig) 


where a@ is the average electronic polarizability of the molecule and n is the 
number of molecules per cubic centimeter. In the most extreme case con- 
sidered here this correction is of the order of 1%. The experimental absorption 
intensity contains the usual frequency factor, v, the value of which varies 
over a wide range in the case of the rotational band. It is expedient to remove 
this frequency dependence by means of an average absorption frequency, 
¥, defined by the equation, 


b= fadv f (a/v)dv. 


The frequency-corrected binary coefficients, @ = a,/, are thus in the nature 
of transition probabilities; they will be expressed here in the form, cycles 
per second per molecule per cm of hydrogen per molecule per cm? of perturbing 
gas, and have therefore the dimensions sec-!cm*. The essential results of 
the reduction of the experimental data are thus a;, 7, and a; the values of 
these are given in columns 3, 4, and 5 of Table III below. 


CALCULATION OF THE BINARY ABSORPTION COEFFICIENTS DUE 
TO QUADRUPOLE INTERACTION 

As shown by Van Kranendonk (1957, 1958) overlap and quadrupole inter- 
actions are together responsible for the intensity of the fundamental vibration 
band of hydrogen. Although both types of interaction should contribute to 
the intensity of the rotational band, it might be expected that in some cases 
the overlap effect would be relatively much smaller than the quadrupole 
effect; the overlap intensity is, moreover, difficult to calculate. On the other 
hand, the intensity of purely rotational transitions due to quadrupole inter- 
action is easily deducible from the theory of Van Kranendonk; such calcula- 
tions have been carried out by Colpa (1957) and Kiss (1957). In the following 
an outline of the theory will be presented. 

The binary absorption coefficient of the pure gas, &1, defined by the equation, 


f(a/v)dv = fadv = a&nq,’, 
is given by the general expression, 
1 


| . _ 8x 7 
(1) = Sn d Pas lwis(R)|*g(R) aR. 





Here P, indicates the normalized Boltzmann factor for the initial state, 7; 
u,,(R) is the matrix element of the dipole transition, 1 +f, and is dependent 
on the intermolecular distance, R; g(R) is the pair distribution function. For 
purely quadrupolar interaction y,;, has the form 


+ * 
(2) Bi = Do Vai, Vom w(O1, 2) Vrym, sym, Cor dor 
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where u(w:, @2), the resultant dipole moment induced by the molecular 
quadrupoles, is a function of the orientations, w; and we, of the molecules 1 
and 2, and Wy» is the eigenfunction of the rotational state J with the magnetic 
quantum number m. The dipole moment induced by the quadrupole moment, 
Q,, of molecule 1 in molecule 2, which has the polarizability, a, is 


= (a2Qi /R*)f(@1), 


where f(@;) is a function of the orientation of molecule 1, and a2 is assumed 
to be spherically symmetric. There is also a dipole moment of similar form 
induced in molecule 1 by the quadrupole moment of molecule 2. 

When wu is expanded in a series of spherical harmonics and a summation 
is carried out over the substates, m, equation (1) becomes 


4 
(3) arnrty) = =" 


an, Qu, {P(J)—P(J+2)} 
LN T+2) Pp rane dk 
(2J+3) 0 R’ 
for the rotational transition, J ~ J+2. P(J/) is the probability of finding the 
molecule in the state, J, so that }(2/+1)P(J) = 1; the term P(J+2) takes 
J 


account of induced emission. Vo(R) is the Lennard-Jones intermolecular 


Vo(R) = 4e ‘(£) 2) 


The total absorption coefficient is a@(quad.) = }(a@;"(J), where J = 0, 1, 
ae ; J 
2, and 3. 

lor the enhancement of the rotational absorption by foreign gases equation 
(3) can be suitably modified. For the intermolecular potential function of 
two molecular types, a and b, the usual approximations were used: 


potential function, 


= V (€c€0), = (1 2)(cgto,). 
The values of the integral, 


I, = [“e vocry ar dR 


(4 ) - “ R° ’ 


were obtained by numerical integration. For the quadrupole moment of 
hydrogen the theoretical value of James and Coolidge (1938), Qu, = 0.45¢ea¢’, 
was used. Values of e/k and o of the pure gases were taken from the summary 
of Hirschfelder, Curtiss, and Bird (1954). Since polarizabilities are frequency- 
dependent, values for use in the calculations were determined from refractive 
indices at 400 cm™, obtained by extrapolation of refractive index data avail- 
able in the literature for the visible and near infrared regions; for the gases 
used it was found that a plot of 1/(m—1) vs. v gave a well-defined straight 
line. The values of the constants used in the calculations, along with the 
values of J, are summarized in Table II; values of @:(quad.) are given in 
column 6 of Table ITT. 
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TABLE II 
Data used in the calculations 


Potential constants 
for the Hg - foreign gas pair 


























Perturbing a a Ts00° Tigs° K Iss° 
gas cm? X 1074 e/k, °K o,A cm7>X 10737, em=>X10737_ em X 10737 
H2 0.806 37.0 2.92 1.69 1.46 1.38 
Ne 1.75 59.4 3.32 0.862 0.831 - 
He 0.205 14.9 2.46 2.56 2.13 — 
Ne 0.396 36.0 2.86 1.83 1.61 ~ 
A 1.65 66.5 3.12 1.06 1.04 — 
Kr 2.45 70.6 3.26 0.896 — 
Xe 4.05 90.5 3.51 0.665 -- _- 
TABLE III 
Experimental and calculated values of the absorption coefficients 
Experimental Calculated 
T. Perturbing a y @ = a/v a (quad.) ai(exptl.) g, (exptl.) — a (quad.) 
= gas cm”? Amgt~? em7! sec™! cmé seci'cm® a (quad.) sec7] cmé 
300 n-He 2.06 107-3 628 1.37X10-% 1.09X10-# 1.25 0.28 107% 
Ne 5.42 636 3.54 = 2.66 " 1.33 0.88 ¥ 
He 0.63 ” 602 0.44 ri 0.108 4.0 0.33 “ 
Ne 1.24 * 615 0.836" 0.289 =” 2.89 0.54 
A 6.71 * 635 4.41 ™ 2.94 1.50 1.5 
Kr 9.60 po 641 6.22 és 5.50 1.13 0.72 
Xe 17.4 - 642 11.1 oo 11.0 1.01 0.1 
195 n-He 1.76 rr 613 1.19 ss 1.06 1.12 0.13 
Ne 4.77 A 608 3.27 o 2.85 1.15 0.42 
He 0.27 e 590 0.187 " 0.0990 "" 1.89 0.09 
Ne 0.82 a 592 0.475 " 0.284 1.67 0.19 
A 5.30 = 610 3.61 si 3.17 1.14 0.44 
85 n-He 1.69 a 562 1.25 = 1.11 = 1.13 0.14 
e-He 1.63 - 521 1.30 = 1.23 1.06 0.07 
DISCUSSION 


The agreement of the experimental values of @ and the calculated values 
of &(quad.) is shown by the values of a(exptl.)/a@:(quad.) and a&(exptl.) 
— a,(quad.) given in the last two columns of Table III. In all cases @;(exptl.) 
is greater than a&(quad.). The results for 300° K form the most extensive 
series for the comparison of the data for the various perturbing molecules. 
Here the ratio, &(exptl.)/a@:(quad.) ranges from 4.0 to 1.02 and shows for 
the rare gases a monotonic dependence on the mass of the perturbing mole- 
cule. It was found that for all of the perturbing molecules a@(quad.) is a 
fairly smooth function of the molecular polarizability, a; thus, the graph of 
logio@i(quad.) vs. logio(a’?) in Fig. 9 is very nearly a straight line. In this 
type of graph the experimental values of &; also form a smooth curve; however, 
this curve approaches that for &(quad.) only for the heavier perturbing 
molecules. It will be noted that, although a@(exptl.) and a@(quad.) vary by 
factors of 25 and 100, respectively, &(exptl.) —a@,(quad.) changes by a factor 
of only 6 and, in the series of the rare gases, has its greatest value for argon. 
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There is thus a part of the experimental intensity which is not accounted for 
by the quadrupole induction of S transitions; this part of the intensity shows 
a smaller variation from molecule to molecule than the quadrupole intensity, 
and is important relative to the latter when the polarizability of the per- 
turbing atom is small. 

The theoretical and experimental investigations of the vibrational band 
show that in addition to the quadrupole induction of S transitions as calcu- 
lated above, other factors might give rise to induced absorption in the low 
frequency spectrum of hydrogen. These are: 

(1) The anisotropy of the polarizability of the hydrogen molecule. When 
the perturbing molecule has a quadrupole moment the anisotropy of the 
polarizability of the hydrogen molecule can give rise to some additional 
S-line intensity. This effect, which has been calculated by Colpa (1957), 
would be present among the gases used here only for pure hydrogen and 
hydrogen—nitrogen and would increase the intensity above a@(quad.) by only 
a few per cent. 

(2) Transitions with AJ = 0 induced by quadrupole interaction. Transitions 
of this type have been predicted by the theory and are identified with the 
Qg component of the vibrational band. In the absence of any interaction 
with the relative translational motion of the colliding molecules the transition 
in the ground vibrational state of the hydrogen molecule would have zero 
frequency and would not give rise to absorption. The width of the Qg line 
shows that there is a relative kinetic energy effect, but it is probable that this 
would lead to absorption only at very low frequencies in the far infrared; the 
effect would therefore not be observed in the present experiments. 

(3) Transitions with AJ = 2 due to interaction in the region of overlap 
forces. As pointed out above this contribution to the S-line intensity should 
be small and a more extended theoretical investigation would be necessary 
to assess its importance. 

(4) Transitions with AJ = 0 induced by the overlap interaction. These 
transitions account for most of the intensity in the Q branch of the vibrational 
band. Experimentally it is found that the intensity is spread over two wide 
components, designated Qp and Qe, which have been interpreted as difference 
and summation tones of the vibrational frequency of the hydrogen molecule 
with a continuum of frequencies due to the relative motions of the molecules 
during the collision. If the same effect is present when no vibrational transi- 
tion takes place, a continuum corresponding to the Qg component should 
be superimposed on the rotational spectrum. Moreover, if the shape of this 
continuum resembles that of the Qg component, the intensity should decrease 
rather rapidly towards higher frequencies in the region of observation, 300 
to 1400 cm~. The physical process involved in the production of the con- 
tinuum is the absorption of a photon by two molecules interacting in the 
region of overlap forces, the energy of the photon appearing after the collision 
as relative translational energy of the two molecules. 

The shapes of the absorption profiles suggest that a ‘translational’ absorp- 
tion of this type actually exists as a continuum underlying the rotational 
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lines. In Fig. 10 the specific absorption profiles for pure hydrogen and for 
the enhancements by the rare gases have been plotted; for greater ease of 
comparison the ordinate scale has been adjusted so that the highest intensity, 
i.e. the maximum of the S(1) line, has the same height in all cases. It is 
evident from these graphs that the shapes of the profiles are entirely con- 
sistent with the assumption of a continuum underlying the S lines and de- 
creasing rapidly in intensity from 300 to 1400 cm~. The half width of the 
S lines and the intensity of the continuum vary greatly from molecule to 
molecule. The characteristic intensity distribution of the continuum affects 
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Fic. 10. Comparison of the specific absorption profiles of pure hydrogen and of the en- 
hancement of the absorption in H,— rare gas mixtures. 


particularly the intensity ratio of the maxima of the S lines, Js,o)™*/ J s,)™"*. 
For xenon as the perturbing gas this ratio is 0.21 as compared with the value, 
0.17, calculated on the basis of the quadrupole interaction; in this case the 
overlap continuum is very weak. For helium, on the other hand, the ratio is 
0.82, indicating that the overlap interaction is responsible for a large fraction 
of the observed intensity. The other gas mixtures show translational absorption 
of intermediate intensity relative to the intensity of the S lines. 
Observations on the vibrational band of hydrogen have shown that the 
intensity of the Qg component, as well as its extent towards higher frequencies, 
decreases as the temperature is lowered. The intensity of the overlap con- 
tinuum in the region of the rotational lines should therefore be less at lower 
temperatures and the observed absorption should be more nearly quadrupolar 
in origin. Table II] shows that the ratio a&,(exptl.)/@,(quad.) is in fact more 
nearly equal to unity for a given perturbing molecule as the temperature is 


lowered. 








376 CANADIAN JOURNAL OF PHYSICS. VOL. 37, 1959 


The intensities of the rotational band for normal and equilibrium hydrogen 
at 85° K present some points of interest. The integrated absorption co- 
efficient, a, is somewhat greater for normal hydrogen; however, both &,(exptl.) 
and &(quad.) are greater for equilibrium hydrogen. The decrease in &(exptl.) 
—&,(quad.) in going from normal to equilibrium hydrogen indicates a con- 
siderable decrease in the overlap intensity when the ortho-para ratio is 
decreased from 3.0 to 1.0. The reason appears to be the fact that paramolecules 
in the J = O state are spherically symmetric; a colliding pair of such molecules 
has therefore a center of symmetry and no induced dipole moment is produced. 

In conclusion it can be stated with reasonable certainty that the pressure- 
induced absorption of hydrogen in the region 300-1400 cm™ is due to both 
quadrupole and overlap interactions. When the perturbing molecule is heavy 
the absorption is produced predominantly by the polarization of the heavy 
molecule in the field of the quadrupole moment of the hydrogen molecule. 
For xenon as the perturbing gas the observed intensity is only 2% greater 
than that calculated for the quadrupole effect. This appears to be a striking 
confirmation of the accuracy of the value of the quadrupole moment of 
hydrogen used in the calculations, namely, the theoretical value of James 
and Coolidge. However, it must not be forgotten that the theory of the 
quadrupole effect as presented above uses approximations for the inter- 
molecular potential of two molecular species; although these approximations 
are widely used, their validity is by no means established. Further elucidation 
of the observed absorptions requires the estimation of the overlap con- 
tribution to the S line intensity and a theory of the translational contribution. 
Some progress has already been made in analyzing empirically the observed 
asymmetric shape of the S lines with a view to separating the total intensity 
of the band into rotational and translational components; the details of this 
work will be reserved for a later communication. 
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GAMMA RADIATION FROM THE U**(n, ¥)U** REACTION* 


P. J. Campion, J. W. KNow.es, G. MANNING,f AND G. A. BARTHOLOMEW 


The y-rays following thermal neutron bombardment of a target consisting 
of 200 g of uranium metalf{ with the U?* content depleted to approximately 
0.034% were examined with a pair spectrometer (Kinsey and Bartholomew 
1953) and a flat crystal diffraction spectrometer (Knowles 1959). With the 
former instrument, the region between 7.7 and 3.0 Mev was surveyed using a 
resolution of about 3°%, and studied in more detail between 4.2 and 3.4 Mev 
with a resolution of 1%. The energy range from 3.4 to 0.14 Mev was examined 
with the crystal spectrometer, Laue diffracting from the (440) planes of a 
single germanium crystal, 25 mm thick. With this single crystal arrangement 
the resolution was 4% at 1 Mev. The region near 0.6 Mev, which contained a 
number of prominent y-rays, was further studied with a resolution of about 
0.5% by a double crystal arrangement, Laue diffracting from the (211) planes 
of calcite crystals, each 23 mm thick. 

The spectra uncorrected for spectrometer efficiencies and y-ray absorption 
are shown in Fig. 1 for the region above 3.0 Mev and in Fig. 2 for the region 
in the neighborhood of 0.6 Mev. The (smoothed) background spectrum, taken 
with an empty sample container, is shown as an emphasized broken line in 
Fig. 1. The spectrum in this figure was obtained with a constant line width 
of 150 kev, while that in the inset was taken with a line width of about 40 kev. 
At this resolution the y-rays at 3.576 and 3.662 Mev are clearly separated. 
From the shape of the peak at 4.062 Mev under both resolutions it appears 
that there may be some significant unresolved radiation in the low energy 
tail of this peak. The energies and absolute intensities of the resolved y-rays 
are given in Table I. The absolute intensities for y-rays above 3.4 Mev were 
determined by the nickel comparison method (Kinsey and Bartholomew 
1953). For radiations below 3.4 Mev the absolute intensities were found by 
direct comparison with the y-rays of known intensity following the 6-decay of 
Np”, which, after several days irradiation, is in equilibrium with the (n, y) 
reaction. The accuracy of the absolute intensities quoted in Table I is between 
30 and 50%. The spectrum corrected for spectrometer efficiencies and y-ray 
absorption is shown in Fig. 3. In this figure the ordinate is expressed in units 
of energy radiated per capture per Mev, and the cross-hatched area indicates 
the uncertainty in the absolute intensity measurements. The y-rays following 
the B-decay of Np*® are well established (Hollander et a/. 1956; Ewan et al. 


*Issued as A.E.C.L. No. 753. 

tNow at Norman Bridge Laboratory of Physics, California Institute of Technology, 
Pasadena, California. 

tWe are indebted to the United States Atomic Energy Commission for providing this 
material. 
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Fic. 2. The uncorrected y-ray spectrum between 0.5 and 0.65 Mev following neutron bom- 
bardment of a target of separated U238. The measurements were made with the double flat 
crystal diffraction spectrometer operating at a resolution of 0.5%. The y-ray energies quoted 
are measured relative to the 0.51094 Mev annihilation radiation. 
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Fic. 3. The smoothed y-ray spectrum following neutron bombardment of a target of 
separated U*8, The spectrum has been corrected for spectrometer efficiencies and y-ray 
absorption. The y-rays associated with the 6-decay of Np*® have been subtracted. The 
estimated contribution due to prompt fission y-rays is also shown. 
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1957) and these y-rays have been subtracted from the curve of Fig. 3. The 
contribution to the remaining spectrum from prompt fission y-rays from the 
residual U** in the target was estimated from the data of Francis and Gamble 
(1955) and is indicated in Fig. 3. Owing to the difficulty in evaluating the 
contribution of fission product y-rays no attempt has been made to portray 
the latter; however, current evidence seems to suggest that both the energy 
emitted per fission and the distribution in energy of these radiations are 
similar to those for the prompt fission y-rays (Maienschein et al. 1958). 


TABLE I 


Resolved neutron capture gamma radiation from the 
U238(m, 7) U9 reaction 





Energy, Mev Intensity, y-rays per capture 





0.5401 +0.0004 0.09 
0.5490+0 .0004 0.09 
0.5772+0.0004 0.09 
0.5885 +0 .0004 0.05 
0.6093 +0 .0004 0.06 
0.6275+0.0004 0.09 
0.6375 +0.0004 0.03 
3.576+0.010 0.02 
3.662+0.018 0.02 


4.062+0.010 0.07 


The energy radiated per capture, | Ev(E)dE, in the U?*(n, y)U?® reaction 
0.14 
was found to be 4.3 Mev, which, in view of the probable errors in the intensity 


determinations, is in reasonable agreement with the neutron binding energy 
of U®**, For the latter quantity the value 4.63+0.15 Mev has been obtained 
from the U**8(d, p)U* reaction (Harvey 1951) and 4.78 Mev from closed decay 
energy cycles (Hyde and Seaborg 1957). No y-rays which could be definitely 
ascribed to the U?*8(n,y)U**® reaction were observed above 4.062 Mev, in 
contrast to the results of Kenney and Mattingly (1956). It is estimated that 
if uranium capture y-rays are present in the energy region above 4.062 Mev 
their intensities must be less than 0.01 per capture. 

An attempt was made to detect y-rays in coincidence with the 4.062 Mev 
radiation. For this experiment a 23-g target of U*5* metal was bombarded by 
a beam of thermal neutrons and the capture y-rays were detected with two 
Nal(TI) scintillation spectrometers (Manning and Bartholomew 1959). A 
differential discriminator associated with one of the detectors was set to 
accept pulses from the 4.062 Mev line and the spectrum in the other detector 
coincident with these pulses was observed with a 100-channel pulse height 
analyzer. The resulting spectrum, although not resolved into separate y-rays, 
indicated that the 4.062 Mev radiation is not the ground state transition. The 
coincidence spectrum was consistent with a binding energy of about 4.7 Mev. 


Ewan, G. T., KNOwLEs, J. W., and MacKeEenzikz, D.R. 1957. Phys. Rev. 108, 1308. 
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THE EFFECT OF CONCENTRATION ON THE MAGNETIC SUSCEPTIBILITY OF 
TRAPPED ELECTRONS AND HOLES IN SEMICONDUCTORS 


F. T.. HepGcock 


In order to avoid the complications and difficulties inherent in the theory 
of impurity band formation in semiconductors, Hedgcock (1955) and Sonder 
and Stevens (1958) have tentatively proposed a simpler model to explain the 
anomalous magnetic susceptibility of certain impurity semiconductors at low 
temperatures. The idea underlying this treatment may perhaps be best ex- 
plained by the following analogy proposed by Mott (1940). We may think 
of an electron bound in an impurity site as similar to an electron bound to a 
hydrogen nucleus but moving in a medium of dielectric constant equal to 
that of the material and with an appropriate effective mass. In practice this 
means that the radius of the smallest Bohr orbit is relatively large (in ger- 
manium about 48 A). If now the concentration of occupied impurity centers 
becomes large enough, some of these orbits will overlap and it seems reasonable 
to assume the formation of impurity ‘“‘molecules” in which two impurity ions 
share a pair of electrons in a manner analogous to the hydrogen molecule. 
The effect on the susceptibility would then be as follows: instead of the two 
electron spins contributing independently to a Langevin type of paramagnet- 
ism, they will now be aligned antiparallel and produce no resultant spin 
magnetic moment. This type of effect is what is needed to explain why at 
high bound carrier concentrations and at low temperatures the specimens 
show a greater diamagnetism than would be expected on the basis of an 
independent electron spin model. However, we wish now to show that although 
this concept is qualitatively attractive, it is quite inadequate quantitatively. 
For this purpose we require measurements of the magnetic susceptibility of 
semiconductors with suitable impurity concentrations. By “suitable” we mean 
(a) of high enough concentrations to have appreciable overlapping of bound 
electrons, (6) of concentrations low enough to retain a non-zero activation 
energy. Measurements of the low temperature magnetic properties of some 
doped germanium samples were made by the author (Hedgcock 1957) but it 
was found impossible to satisfy both conditions (a) and (6) in germanium, 
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In silicon, however, suitable concentrations can be obtained and measurements 
on such samples have been made by Sonder and Stevens (1958). From their 
measurements these authors deduce the number of electrons bound at low 
temperatures (from the measured Hall coefficients) and the number of magnetic 
centers at low temperatures from the magnetic susceptibility versus T~! 
graphs. These deduced values are listed for two samples of high concentra- 
tions in Table I, columns (1) and (2). 




















TABLE I 
(1) (2) (3) (4) 
Charge carrier con- 
centration deduced from No. of bound elec- No. of overlapping No. of overlapping 
Hall coefficient at high trons derived from bound electrons bound electrons 
temp. low temp. sus. (experimental) (calculated) 
5X10! 1.2108 3.8X10!8 1018 
1.1108 10!” 106 


1.21018 


Column (3), which is the difference between (1) and (2), is interpreted as 
the number of overlapping bound electrons, i.e. the number of bound electrons 
which do not appear to contribute to the resultant magnetic moment. 

We now wish to compare this number (column (3)) with the number of 
overlapping bound electrons which would be calculated on the basis of the 
model outlined above. We assume that when two impurity centers lie within 
a certain critical volume determined by the minimum potential energy of the 
system, a ‘‘molecular’’ impurity site is formed. The separation of the im- 
purity centers for molecular behavior is found by modifying the energy 
expression given for molecular hydrogen (see Glasstone 1949) by including an 
effective mass and a dielectric constant appropriate to the material (here, 
silicon). For a perfectly random binary solid solution we can calculate the 
probability of finding two impurity centers close enough for overlapping to 
occur at any specified concentration. This can be done by following the simple 
statistical theory of solid solutions outlined by Smoluchowski (1951); column 
(4), Table I, shows the result of this calculation.* 

It is seen that the agreement between columns (3) and (4), which compare 
the experimentally derived and theoretically estimated number of overlapping 
bound electrons, is only qualitatively correct. The simple model presented 
here is thus clearly inadequate. It seems therefore that to explain the large 
decrease in the paramagnetism of trapped charge carriers at low temperatures 
one must assume the existence of an impurity band. It should be noted, 
however, that attempts to interpret the results on germanium (Hedgcock 
1957) on the basis of Mooser’s (1955) theoretical treatment of the suscepti- 
bility due to trapped electrons in an impurity band were not conclusive. More- 
over Sonder and Stevens (1958) were unable to explain their results in silicon 
on this basis although their argument is not altogether convincing. It seems 

*It is perhaps of interest to note that the same calculation shows that complete molecular 


behavior of electronic bound states should be observed when the bound electron concentration 
reaches 7.5 10!8/cc in germanium and 4.3 X10!%/ce in silicon for n-type materials. 
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therefore that Mooser’s treatment should be extended to a more general 
model of an impurity band; however, Kohn (1957) has emphasized the diffi- 
culties inherent in such a calculation. 


The author wishes to mention the generous support of this research given by 
the National Research Council of Canada and the Defence Research Board. 


GLassTONE, S. 1949. Theoretical chemistry (D. Van Nostrand Company, Inc.), p. 79. 

Hepccock, F. T. 1955. Susceptibility of Ge.Sb alloys. Ph.D. Thesis, University of Western 
Ontario, London, Ontario, 

——— 1957. J. Electronics, 2, 513. 

Koun, W. 1957. Solid state physics, Vol. 5 (Academic Press, Inc.). 

Mooser, E. 1955. Phys. Rev. 100, 1589. 

Mott, N. F. and Gurney, R. W. 1940. Electronic processes in ionic crystals (Oxford 
University Press), Chap. 3. 

SMOLUCHOWSKI, R. 1951. Phys. Rev. 84, 511. 

Sonper, E. and STEvENs, D. K. 1958. Phys. Rev. To be published. 


RECEIVED DECEMBER 5, 1958. 
DEPARTMENT OF Puysics, 
UNIVERSITY OF OTTAWA, 
Ottawa, ONTARIO. 








LETTERS TO THE EDITOR 





Under this heading brief reports of important discoveries in physics may be published. These 
reports should not exceed 600 words and, for any issue, should be submitted not later than six weeks 
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Impurity molecule-ions and impurity molecules 


The possibility of impurity molecule formation has been pointed out by Hedgcock (1959). It 
is conceivable that at intermediate impurity concentrations the bound impurity atoms do not 
overlap sufficiently to form bands, but that by pure chance, it is possible that a small number 
of impurity atoms combine to form impurity molecules. There is some experimental evidence 
for this point of view (Fletcher et al. 1954; Slichter 1955; Hedgcock 1959). It does not seem 
to have been pointed out that one might also expect the formation of impurity molecule-ions 
in heavily compensated samples, even when the n- or p-type excess is small. 

We are investigating the formation of such impurity states theoretically using the effective 
mass approximation. Although atomic impurity states bear a close resemblance to the states 
of atomic hydrogen, this analogy does not hold for molecular impurity systems. The reason 
is that the “nuclei’’ of an impurity molecule are held rigidly in place by valence bonds with 
neighboring atoms. This fact has two interesting consequences. 

(1) The rotational bands found in ordinary molecular spectra are absent in molecular 
impurity spectra. 

(2) The internuclear distance is not determined by the minimum in the electronic energy 
curve, but is fixed by the impurity concentration. 

Because of (2), the impurity molecules (or impurity molecule-ions) do not have a definite 
internuclear separation, and we expect the ionization energy to be dependent on concen- 
tration. 

The analysis for molecule-ions is somewhat simpler theoretically than the corresponding 
calculation for impurity molecules. For a system of N acceptor impurities and N-+1 donor 
impurities the matrix element of the en in the Bloch representation is: 


Oe aaa aay eee 
(nk|U|n'k’) = —4ne® oat ee . jenn oe ik, 
V(k— k’) n=1 n=1 


where « is the dielectric constant and r,*, r,~ are the position vectors of the donor and acceptor 
impurity ions respectively. If we consider only the two positively charged ions (all other 
impurity ions being assumed far removed) this matrix element may be shown to become: 


bibei cinta —8re° _ Cos cos (k— k’): ‘a 
(nk|U|n’k’) = eV (k—-k’y’ 


where 2a is the vector from one impurity ion to the other. The above problem is now mathe- 
matically analogous to the double-well problem with a potential 


—e| 1 1 
r+a| r—a; 


If the effective mass is isotropic, an exact solution is possible, otherwise a variational approach 
must be adopted. | : 
Detailed results will be published at a later date. 
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